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Abstract 

Traditional modeling notions presume the existence of a "truth" model that relates 
the input to the output, without advanced knowledge of the input. This has led to the 
evolution of education and research approaches (including the available control and 
robustness theories) that treat the modeling and control design as separate problems. 
This paper explores the subtleties of this presumption that the modeling and control 
problems are separable. A detailed study of the nature of modeling errors is useful to gain 
insight into the limitations of traditional control and identification points of view. 
Modeling errors need not be "small" but simply "appropriate" for control design. 
Furthermore the modeling and control design processes are inevitably iterative in nature. 

Introduction 

It is difficult to know a priori what type of modeling errors will be significant in 
the control design problem, and this issue is typically left to ad hoc approaches with the 
hope that "robust" control design techniques will somehow compensate for any error left 
in the model. However, significant performance improvements are often possible by 
obtaining more appropriate models at the outset, as opposed to placing all the burdens for 
the compensation of modeling errors on the control design. Even in identification 
approaches to modeling, the adequacy of the model for control design is unknown [1]. It 
is useful to revisit the formal structure of modeling errors to point out some common 
misunderstandings about their nature. 

Perfect models are never required nor are they possible to construct. Some 
modeling errors are always acceptable. For example, in most aircraft or spacecraft 
control problems, these errors are probably acceptable without reservation: (i) 
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microscopic effects (molecular motion in the wings), (ii) relativistic effects, (iii) the 
higher order effects of earth oblateness on the environmental disturbances. 

It is common practice in engineering to develop models of dynamic systems 
without regard to the impact that input forces have on the validity of the model. Unlike 
relativistic errors, this is a serious impact that needs further clarification. We shall see 
that the nature of the input forces dictate the validity of the model. If the input forces are 
to be controlled by a feedback controller, this means that the controller design dictates 
the validity of the mathematical model, rather than the other way around. (Control texts 
like to say that the fidelity of the controller is dictated by the fidelity of the model). An 
example clarifies the idea. A plate of steel might be characterized as a rigid body. A 
rigid body model is appropriate if the forces to be applied are relatively small. Large 
forces will deform the plate and can render a rigid body model useless for predicting the 
dynamic response. This argument continues indefinitely. The more accuracy required in 
the predictions of dynamic response, the more careful one has to be about including 
small effects in the dynamic model (non-homogeneous material properties of the steel, 
air turbulence, thermal gradients, etc.). This quickly leads to the conclusion that the 
validity of a model cannot be assessed by its open loop response. The modeling errors 
that are acceptable depend upon the control forces. 

Figure 1 illustrates the steps in the modeling process. The first step is to adopt an 
"idealization" of the system. Two examples will clarify the meaning of "idealization" 1) 
If the physical phenomenon is a mechanical structure, we might "idealize" the structure 
as a rigid body. Another idealization is a flexible plate. The actual structure is neither a 
rigid body nor a flexible plate, but such a hypothesis (e.g. rigid body) forms our 
"idealization" of the structure and this occurs before any mathematical models are yet 
formulated. 2) Another type of idealization occurs in identification experiments. When 
we try to fit a canonical model (e.g. phase variable form) to the input output data, we say 
that the canonical structure forms our "idealization" of the plant (The word plant is used 




Fig. 1 Steps in the Modeling Process 


to include the characterizations of the disturbance environment as well). Hence, 
idealizations lead to errors in model structure and model order. 

The second step in the modeling process is to apply known physical laws to 
develop a mathematical model of the idealization. This step introduces parameter errors 
(values of mass, inertia, spring constants, etc.). This step might be omitted if the model 
is developed from experimental data. 

The third step is the simplification of the model. High order models may be 
reduced to low order models. This step leads to errors in model order and parameters. 

The fourth step is controller design based upon the model obtained in step EL The 
purpose of this paper is to discuss the types of errors that have small and large impacts on 
the controller design. 

The controller design procedures which are tractable are usually of high order. 
(Both LQG and H** controllers are of order generally equal to the model order). Hence, a 
fifth step in the modeling and control design process is controller reduction. (If an 
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optimization is performed to obtain the controller of specified low order directly, the 
controller reduction and/or the model reduction step is omitted). 

This paper is a tutorial on the concepts of model errors in model based control 
design. The paper poses some old questions, some new ones, and adds some conjectures. 
The intent is to clarify some common misunderstandings about the nature of modeling 
errors and their impact in feedback control. 

The paper is organized as follows. Section 2.0 cites simple examples which 
motivate the need for a more precise way to characterize "acceptable" modeling errors. 
Section 3.0 characterizes the structure of all modeling errors, and describes a modeling 
and control inseparability principle. Section 4.0 describes the closed loop impact of 
modeling errors and presents an uncontrollability, unobservability principle. Section 5.0 
offers some conclusions. 

2.0 Motivating Examples and Some Modeling Principles 

This section illustrates four modeling principles by examples. 


MODELING PRINCIPLE I: arbitrarily small modeling errors can lead to arbitrarily 
bad closed-loop performance 

For the system described by the transfer function 


G(s) = 


1 

(l+S)(l+£S) a ’ 


( 1 ) 


let e > 0 be small, possibly representing fact actuator or sensor dynamics. If the fast 
dynamics are ignored (a common approach in practice) then the control design model 
becomes 


G R (s) = 


1 

1+s 


( 2 ) 
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where, for a = 1, it may be shown that the step input errors between G(s) and Gjte) are 
bounded by 


iy(t)-y R (t)i=L-M 


— [G(s)-G r (s)]| 

S 


2ee~ t 

1-e 


t>0 


(3) 


Hence the modeling errors as measured by the open-loop response are arbitrarily small if 
e is arbitrarily small. Suppose an output feedback controller with u = -Ky. If 


K < — , 
4e 


(4) 


then the model Gr(s) is also useful for predicting closed loop behavior, since 


yR(s) 


G r (s) 


— - y(s). The interesting observation about (4) is that it tells that 
s 


1+KGr(s) 

the usefulness of the control design model G R (s) for predicting closed loop behavior 
depends both upon the modeling error (characterized by e) and the controller gain K. 
This illustrates the theme of this entire paper, the modeling and control problems are not 
independent. 

Now suppose the requirements on modeling error and control design (4) are 
violated to the extent that 


(The reader may verify that K - — corresponds to the breakaway point on the root 

locus). Now the model G R (s) and controller (5) predict a closed loop system with no 
overshoot, no oscillations and a small steady state error. However, the actual response 
will yield severely underdamped oscillations for a = 1, and instability for a = 2). 

The point of this example is that arbitrarily small modeling errors do not lead 
necessarily to small errors in the closed loop predictions. The following example 
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illustrates that small modeling errors can lead to large errors in optimality and maximal 
accuracy predictions. 

Consider (1) with a = 2 and the same G R (s) = 1/s+l. Let K be the optimal control 
for the model G R (s) so as to minimize 


V = J [y|(t) + pu 2 (t)]dt = V y + pV u 

O 


( 6 ) 


Then 


K = -1 + Vl+p -1 


(7) 


and the closed-loop system performance of (1) with u = -Ky is described in Fig. 2 in 


terms of V y versus V u as p varies from » — ► 0. 



Fig. 2 Errors in optimal controller (not to scale) 


V u 


For p 
based 


< — the actual closed loop system is unstable, whereas the predicted behavior 
4 

on G r (s) approaches its maximal accuracy, [V y (predicted)] -» 0, as p 0. 
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Hence, in the neighborhood of maximal accuracy predictions for Gr(s) in Fig. 2, the 
actual system delivers its worst performance. 

Now for the system (1), consider the "absurd" design model 


Gr(s) = 


1 


(1+ES) 


( 8 ) 


in lieu of Gr(s) = — . This model is "absurd" to the extent that the open-loop step 
response error is not necessarily small. The optimal control for this realization of the 
plant (A = -— , B = — , C = 1) yields the optimal control u = -Ky, K = -1 + Vl+p“*, 

E £ 

which is precisely the same K as for model Gr(s) = Hence the actual performance 

is the same as that described in Fig. 2, and models G R (s) and G R (s) are equivalent for the 
purposes of control design by (6). There is an infinite number of controllers in this 
comparison (one for each selected value of p). Hence, this equivalence is not just for an 
isolated control based upon models Gr(s) and G R (s). 

The performance predicted by model Gr(s) is shown in Fig. 2 to be arbitrarily far 
from the actual performance for large values of p and for small values of p, although the 
predictions are close to those of model Gr(s) for small p. 

In Fig. 2, the comparison between the "actual" and "predictions from G(s)" indicate 
that for small gains (small V u , large p) the actual performance agrees with the 
performance which would be optimal using the exact model G(s). That is, model errors 
do no damage for small enough control efforts. The actual performance has a "best" 
performance at a particular value of control effort (p = e 2 ) and increasing control efforts 
beyond this point degrades performance leading eventually to instability (p < e 2 /4). All 
controllers will drive a physical system unstable as the control effort is increased enough 
(see the conjecture below). The shaded area in Fig. 2 describes the difference in 
performance between that which would be optimal for the exact plant G(s) and that 
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which results from controllers which are optimal with respect to an erroneous model 
(G r (s) or G r (s) in our example}. This property is generic, according to the conjecture 
below. 

Let S c be a linear dynamic controller which is optimal for the linear model 5 2 . Let 
the actual linear plant be S 0 and its optimal controller be 5 C0 (which we cannot 
construct!). Since mathematical models are always approximations of the physical plant 
S 0 * S 2 . Let V y denote the output norm when S c drives S 0 . Let V yo denote the output 
norm when S M drives S Q . V y takes on its smallest value V y when the control norm has 
value Vy. VyQ takes on its smallest value V yo when V u is arbitrarily large V uo , see Fig. 2. 

Conjecture: 

Controller S c (optimal for model driving plant S 0 always yields an unstable 
closed-loop system in the neighborhood of maximal accuracy predictions using any S 2 . 
Hence, 


lim V y (p) = oo , 

p-*0 7 

(9a) 

lim [V y (p)-V yo (p)]=0 

p-+- 

(9b) 

V >v 

JLy JLyo 

(9c) 

V < V 

_u ^ L.uo 

(9d) 


indicating that the actual maximal accuracy V y = {min V y (actual)} occurs always at a 
lower value of control effort V u than the predicted maximal accuracy V yo = 
{min V y (predicted)} which occurs at a value ofV w . 

This conjecture asserts that real controllers (always based upon erroneous models) 
will always be bad (unstable) for large control effort V u and will always be as good as the 
optimal controller for arbitrarily small control effort (9b). For quadratic criteria, this 
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suggests that a meaningful comparison of candidate controllers should be conducted in 
the neighborhood of the maximal accuracy V yA for each controller. Each controller 
design, (based perhaps upon different reduced order models), will yield a different V yA . 
Furthermore one cannot say a priori which is the best reduced order model (as the above 
examples illustrate). Hence, again we see the dependence between the modeling and 
control problems. 

MODELING PRINCIPLE II: large open-loop modeling errors do not necessarily lead to 
large closed-loop prediction errors 

Most of the available theories on model reduction [1-14] try to achieve "small" 
modeling errors, according to some open-loop criterion for minimization. (Example 1 
may even provide some motivation for this goal). The appreciated virtue of balancing 
and Hankel norm methods [14] is the existence of an upper bound on the size of the 
transfer function error. The purpose of this example is to illustrate that it is not necessary 
to have small modeling errors if the control scheme has the right characteristics 
(remember the theme of the paper). Consider a plant described by 

G(s >=ir (,0) 

and an approximate model 

G R (s)=l/s. (11) 

The controller again is output feedback u = -Ky. Note that the actual system (10) is 
asymptotically stable, whereas the approximation (11) is not. G R (s) would not be 
deemed a good approximation of G(s) by any of the model reduction theories available in 
the literature. It is interesting, however, to ask "in what sense is G R (s) a good model for 
predicting closed-loop performance of the plant?" 
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(i) Stability of G R (s) under the feedback gain K will also yield stability of G(s) 
under this K. 

(ii) Errors in predicting performance of the step response are given by 

i y(t) — y R (0 1 <-j~, t>0. (12) 

Hence the errors in the closed loop response predictions can be made as small as desired 
by choosing an appropriate K, despite the fact that open-loop errors are large. Thus, 
closed loop effects of the modeling errors may be smaller than the (open-loop) modeling 
errors. The conclusion from this example is that the modeling errors should be 
appropriate for the controller design and not necessarily small. (We do not suggest that 
high controller gains are always appropriate). 

To illustrate that large modeling errors can yield even zero closed loop errors, 
consider the following examples. We have already shown that two models of the same 
order, yet not related by a coordinate transformation, can yield the same controller (7). 
This concept is not limited to models of the same order. Consider models for standard 
LQG design: 


x = Ax + Bu + Dw , z = Cx , z = Mx + v 

with zero mean uncorrelated white noises w, v, with intensities W = 4, V = 1. The same 
controller given by 

x c = A c x c + F z , u = Gx c 
A c = -3 , F = 2 , G = -2 , 


Optimizes all three of these models: 
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n> 
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0 

A,= 

— ! 0 
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1 
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1 
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* 
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It 

-1 -1 
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0 
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II 

C4 

Q 

* 

.417 


Ci = [1 0 ] 
M, = [0 1] 


C 2 = [V£ 4 . 079 ] , 
M 2 = [.8, . 5 ] 


(HD 

A3 — — 1 , B3 = .2 , D3 = 1 .5 , C3 — - V3 , M3 — .8 
with respect to the design objective V, 

V =E„(8y 2 +u 2 ), E„ = lim E . 

t— 

That is, the same closed loop value of the performance metric is obtained with all three 
models (Mil). Hence models (I-III) are equivalent models with respect the control 
objective V. The striking observation here is that the open loop responses of the "control 
equivalent" models (I-IV) are not similar nor "close" by any open loop measure, yet all 
three models are equally good for the control design. Fig. 3 shows an open loop impulse 
response of the models. Two of these models were taken from [ 26 ]. Other control 
equivalent models are discussed in [ 27 , 28 ]. 

Let a given system S 0 be controlled by a controller S C (S 1) based upon a model S j. 
Modeling Principles I and II and the above numerical examples lead immediately to the 
conclusion that making errors "small" is extremely difficult and also unnecessary. 
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Fig. 3 Impulse response of three "control equivalent" models 
Modeling Principle m is a natural consequence of these two facts. 

MODELING PRINCIPLE ID: open-loop modeling errors ( and hence their bounds) do 
not generally constitute enough information for successful control design. 

Of course, the previous discussions and examples provide the clue to the missing 
information. Knowledge of inputs are required for any assessment of fidelity of the 
model. This principle is not limited to quadratic criteria for system evaluation. See for 
example from the Nyquist plot of Fig. 4 that many different models of a plant may yield 
controllers with a common gain and phase margin. Hence, models may be "control 
equivalent" by any control design criterion, classical or modem. Such equivalence seems 
to have no direct relationship to the open loop modeling errors. 
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Fig. 4 Three controllers yielding the same gain & phase margins. 

3.0 The Structure of Modeling Errors 

For the sake of discussion only, imagine a mathematical model so accurate as to 
mimic the physical phenomena for all practical purposes. Then in the following 
arguments we refer (by a slight abuse of language in this Section only) to the model 

x -Ax +Bu + w + f(x,u, t) (13a) 

z=Mx + v+ g(r,u,t) (13b) 

as the "physical system" with the control inputs u £/?"*, disturbance inputs w e/? n ’ and 
measurements z eR n \ The terms f(x, u, t), and g(x,ut) represent nonlinearities and 
w (t) represents any time varying disturbances (w (t) is not a function of x or u). Of 
course, to attribute "physical system" status to (13) the dimension of the state x 
approaches infinity. We shall consider it large but finite. Now with respect to any 
mathematical model of the form 
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x = Ax + Bu + w, xeR n ‘ 

(14a) 

z = Mx + V ZER”' 

(14b) 


which might be used to represent the actual system (13), we wish to completely 
characterize the model errors of (14) with respect to (13). For this purpose rewrite (13) 
in the partitioned form 


*R 


Ar a rt 

r y 

*R 


Br 


p * 

W R 


f R (x,u,t) 

x x 
• - 


_ TR ^T. 

x T 

+ 


u 4* 

Wf 

+ 

f T (x,u,t) 


z=[M r M t ] 


xr 

. X T. 


+ v + g(x,u,t) 


where x R eR has the dimension of x. Using the following definitions 


AA = A R - A , AB = S R -B , AM=M r -M, 


The equation for x R and z can be written, from (15), (16) 

x R — (A+AA)x R + A R t*t + (B+AB)u + w R + f R (x , u, t) 
z ss (M+AM)x r + MjXj + v + g(x , u, t) 


(15) 


(16) 


(17) 


or simply. 


i R = Ax r + Bu + w + e x , 
z = Mx r + v + e z 


(18) 


where 
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* ^ 
e x 


AA 


AB 


Art 


Wr-W 


' f R(*R>*T»U,t) 

A 

— 

AM 

*R + 

0 

u + 

M t 

x T + 

v-v 

L J 

+ 

. g(*R4t T ,U,0 


c = e p + e 0 + e d + e n (19) 

We now have the following conclusion from (19). 

MODELING PRINCIPLE IV: For every model of the form (14) there exists "model error 
vectors" t x and e z which represent " corrections " to the state equation (14a) and 
measurement equation (14b), respectively, such that z evolving from (14) matches the 
measurements z from the physical system (13 ) if e x is added to (14a) and t z is added to 
(14b). Furthermore, the model error vectors can always be decomposed into the sum of 
four kinds of errors: parameter errors e p , errors in model order e 0 , neglected 
disturbances e d and nonlinearities e„. 

Note that all four types of modeling errors ep, e 0 , e d , and e n are always present with 
any mathematical characterization of a physical plant We wish also to declare at the 
outset that there exists no control theories which can promise satisfactory control in the 
simultaneous presence of all four categories of modeling error. Techniques are available 
which have made progress only in the accommodation of a subset of these four types of 
modeling errors. 

Certain other conclusions are also obvious from (16). The partitioning of the state 
vector (15) was necessary to define "parameter errors" with respect to a specified model 
of lower order. The matrices A R , B R , and M R result from a partitioning of (13) after (13) 
is written in a selected coordinate frame. It should be clear from (16) that since the 
parameters A R , f? R> and M R depend upon the initial coordinates in (13), the phrase 
"parameter errors" in dynamic systems has no precise meaning. Indeed, the definitions 
(16) are as precise as one can be, yet these definitions are arbitrary to within a coordinate 
transformation on (15). Note that if one chooses a canonical structure for (A, B, M) in 
(16), such as phase variable or Hessenberg form, etc. the parameters of (A R J3 R Mfi) 
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may not have the same structure. Hence, (AA,AB,AM) does not generally have a 
canonical structure. This suggests that one should not expect parameter adaptive control 
schemes to converge if a presumed canonical structure of the parameters has been 
presumed. Also, note from (19) that a change in coordinates which changes the e p term 
will also change the other terms e 0 and e n . Hence, the individual terms e p , e„, e d and e n in 
the model error decomposition (19) are not unique. 

It does not serve our purpose to characterize the model error vector explicitly in 
terms of higher order model states x T . For the reduced model (18) the vector functions 
e x and e z can be considered functions only of r R and u and t. Eq. (18) shows that e 0 is a 
function of Xj, but x T is in turn a function of x R , u, and t, as the solution of the second 
equation in (15) reveals 

t 

Xj(x R ,u,t) = Oj(t,0)x -j-(0)+JOj(t— o)(A tro: r +5 i-u+w-p4-fj-(r R ,x -p,u,CT))dcy (20) 


where Oj is the state transition matrix for A T . Hence, for (18) we may consider e x to 
depend only on (x R , u, t), and when we need to do so we shall write e x (x R ,u,t). In fact, 
using (20) the model order error e 0 of (19) may be characterized by 


e o = 


Art 

Mj 


[<& r (t,0)x T (0)]+ 

__ t 


Art 

Mj 


[Jd>r(t-a)wT(o)da]+ 


Art 

Mj 


[JOr(t— a)A trx R da] 


*Ot 


Cod 


“'OX 


Art 

Mj 


[Jd>r(t — o)B T uda]+ 


J O 


/lRT 

M t 


[ Jdyt-a)^ R ^ T . u .^)da] 


( 21 ) 


“OU 


"on 


From (19) and (4) conclude that e may be written 
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c = {e px +e ox } + {Cpu+CoJ + {e^+c^j+Cd) + {Cn+c^} . (22) 

With respect to the reduced model (14) the first bracketed term in (22) denotes errors 
which depend on the state x R (e px is a linear function of x R , while e ox is an integral 
operator on x R ), the second bracket denotes errors which depend on the control u-e^ is a 
linear function of x R , while e ou is an integral operator on u, the third bracket in (22) 
denotes errors which depend only on time, and finally the last bracket in (22) denotes the 
errors due to nonlinearities. 

It is important to note that the model error vector e x in (18) depends upon the 
integral of the input u. Hence, one cannot assess the impact of modeling errors e x 
without knowledge of the nature of the controls. Even small inputs u can have an 
arbitrarily large effect in e x (t). To see this let Aj be a positive scalar and u(t) = e = 
constant. Then e ou (t) = (A RT 5 T e/A T )(e A ' rt -l) gets arbitrarily large in this example. 
Hence, the homogeneous part of the system may be modeled arbitrarily closely 
(e p arbitrarily small or zero) and yet the model may not be acceptable for control design 
( Modeling Principle I). 


The Model Error System 

The model error structure (21) can be further detailed. See that e ox satisfies the 
differential equations 


i4 RT 



^TR*R + 



t 

J A jO^t— o)A trx R da 

O 


(23) 


and the definition 


“'ix 


4 



t 

j Aj <X>x(t— crM R da 

o 


(24) 


allows (23) to be written 
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®o* = 


^RT 
Mj 


^TR*R + C lx 


and likewise for e lx and e. 


IX 


c lx = 


Art 

M t 


A trX r + e2 X 


e« = 


Art 

M t 


ATATRXR + e ( i + i) X . i = 0, 1, 


Similarly for e ou in (21) 


e ou ” 


Art 


5 x u + e lu 


e iu = 


•^RT 


A X B T u + e^ +1 ) u i = 0, 1 , .... 


(25) 


(26) 


(27) 


The deleted state Xj, and hence A x , is usually quite large so that the range of i is a large 
finite number. 

Equations (26), (27) and (22) are now combined with (17) to give the exact 


structure of the model error system 

x R = Ax r + Bu + w + B 10 [(e ox +e ou )+(e px +e pu )+e t +e N ] (28a) 

Cjx = P»^r ®(i+i)x i - 0, 1 , ...» . (28b) 

®iu = Qi^ ^ ®(i+l)u i = 0 , 1, ...» . (28c) 

z = Mx r + V + E 01 [(e ox 4e ou ) + (ep^jJ + e t + e N ] (28d) 

where E 10 - [1 0], E 01 - [0 1], 
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e t “ e ot + c od "*■ c d » e N “ c n + c o 


using the definitions of e ot , e d , e n , e on in (19), (21). 
In state form (28) becomes 



(29) 


(30) 



\ 


20 


Note that the matrices which are unknown are Pj, Qj, AA, AB. Knowledge of the model 
error structure (31) might be very useful in analysis (predictions of performance) and 
control design. Most adaptive approaches to control ignore the e^, e iu i = 0, 1, • • • terms 
in the model error system (31). It might be useful to research the inclusion of one extra 
term. 

The transfer functions of (28) are developed as follows. In the context of linear 
systems, we shall ignore e N . Take the Laplace transform of (28) and see that 

z(s) = (M+AM+E ol (£p/s 5+1 ))[(sI-A)-AA-E 1 o(ZP i /s 5+1 )] _1 [(B+AB+ 
i=0 i=0 


Eio(£Oi/s i ))u(s)+E 10 e t +w] + E ol (£o i /s ifl )u(s)+v(s)+E 1 oe t (s) 

i=0 i=0 


To simplify this expression define 


p . M 

' ix 




AA when i = 0 
EioPj-i wheni>0 




AM when i = 0 
E 0 iP}_i when i > 0 



AB when i = 0 , A 
EioQw when i > 0 ®’ a ~ 

to 

<D(s) = (sI-A)" 1 


then 


z(s)={[M+£P i2 /s i ][<D- 1 (s)-IP ix /s i r 1 [B+2:!2ix/s i ] 

i=0, i^), i=0. 


+ SGi/s 5 } u(s) + e t (s) 


(32a) 


(32b) 


( 33 ) 


where e t (t) represents the effects of only rime -dependent terms. 
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e,(s) = [M+2/ , i^s 5 ][<l>" 1 (s)-I/ , i X /sT 1 [w(s)+E 1 oe t (s)] 
i=0 i=0 

+ E ol e t (s) + v(s) . 

Let the inverse of the sum of two matrices be written 


[O-i+vj/r 1 = «D£ [(-lyOFOy] . 
j=o. 

This can be verified by writing [C>“ 1 +'P]x = y as <J> -1 x = (y-'Ex) or 

x = <D(y-'Fx) 

= Oy-<D'P<D(y-'Fx) 

= etc. 


Now let 


^2 “ Z^ 'M Z^ix/S 1 . 'MzsiX 

i=0, i=0, i=0, i=0 

j(s) — K-iW(sms) 
j=l. 

Then 


z(s) = {[M+'? 2 (s)]<D(s)[I+J(s)][B+'F3(s)]+'F4(s)}u(s>4-e t (s) 

= [G(s)+AG(s)]u(s) + e t (s) (34) 

expressed in terms of the (known) transfer function G(s) = MO(s)B, where 

AG(s^'F 4 (s)+MO(s)^3(s)+[M<I>(s)J(s)+y2(s) < I > (s)+'P2(s) < I>(s)J(s)][B+T3(s)] 


<? t (s) = [M+'F 2 (s)]O(s)[I+J(s)][w(s)+E 10 e t (s)]+E ol e t (s)+v( S ) 


(35) 
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Both expressions (31) and (35) simplify greatly by special choices of coordinates 
of (15). Without loss of generality one can take AM = 0 and Mj= 0 if dim z S dim ;t R . 
To see this, note that a similarity transformation on the state x = Trj can always take the 
measurement z = Mx to z = MTn = [I0]r| if rank M = dim z £ dim x . Of course, we 
cannot construct this T (since we don’t know M), but we know that it exists. Therefore 
by assuming that the states of our model are not fewer than the number of measurements, 
we can, from (29, 32) without loss of generality set 

E o iPi = 0, E ol Qi = 0. 

Hence, from (32), Q a - 0, and = 0 for all i. This simplifies (35) to 
AG(s) = MO(s)'F 3 (s) + [M+'F 2 (s)]0(s)J(s)[B+'F 3 (s)] . 

e t (s) = M<D(s)[I+J(s)][w(s)+E 10 e t (s)] + v(s) (36) 

The state equations (31) also simplify in an obvious way when AM = 0. The term 
e t in (33) and (36) represents unknown time varying excitations arising from a 
combination of both external disturbances and errors of model order. When AG(s) is 
assumed zero, some attention has been paid in the literature [16] to the determination of 
upper bounds on e t which can be tolerated before losing stability. In these studies, the 
AG(s) term in (34) and (36) has been ignored in the control design. See that u(s) 
multiplies AG(s). Hence, it should be emphasized that the effects of model error e x in 
(18) cannot be assessed independently of the control law u(*), and this is the fundamental 
pitfall that prevents the modeling problems and the control problems from being 
separable. 

It is also common to analyze the effects of AG(s), but ignore the effects of e t (s) in 
(34),(36) [17-18]. However, these terms both coexist in the presence of errors in model 
order. Imagine the physical process z(s) = [G(s)+AG(s)]u(s) + e t (s) and a model G(s) 
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which might be used for design of the controller H(s). See Fig. 5a. Control theory has 
reached a sophisticated level of maturity assuming that the model G(s) + AG(s) is 
specified a priori, or, if not given, then the assumption is that the model is something 
that exists in an absolute sense irrespective of the control policy (which is yet to be 
developed). The thesis of this paper is that the model and the control policy must be 
developed together and that no meaning (in an absolute sense) can be attached to either 
one in isolation. These concepts do not readily lend themselves to established definitions 
of robustness, or to certain identification concepts. 

In the time domain characterization of the model errors, the parameters P if Qj in 
(29)-(31) are key. In the frequency domain characterization of model errors, the 
spectrum AG(jco) and e t (jco) in (36) are key. Which is easier to approximate or 
parameterize in practice? We cannot say. H“ methods rely on a characterization of 
AG(jco), e t (jco), [17, 18, 20, 21, 25). This is never possible exactly, but neither is an exact 
determination of Pj, Qj in (31). One available method does not need either the frequency 
or time domain characterization of modeling error, but only needs a geometric condition 
about the space of the model error vector e x in (18). If e x lies in the column space of B, 
(a "matching condition") then only an upperbound on e x is needed and not its spectral 
content [22]. The matching conditions are easily violated in problems with errors of 
model order. Yedavalli [23] shows bounds on each element of the A matrix 
perturbations which preserve stability. 

We cannot associate gain and phase margins with tolerance of any one of the four 
categories of model error in (19), although errors in model order (xj * 0) will certainly 
modify the phase of the system. It can be said (without any sort of precision), that larger 
phase margins may allow the design to be less sensitive to errors in model order. Gain 
margins on the other hand do not necessarily provide tolerance to either parameter error 
or model order error. Refer to Fig. 5 and AG($) to see that the Nyquist test for stability is 
satisfied if over all frequencies to, the full length of the vector l+HG(jco) is larger than 
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Block Diagram of Perturbed System 

Fig. 5a 



Nyqulst Plot of Ptrturbsd Systom 


Fig. 5fc 
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the length of the vector HAG(jco), since the -1 point cannot be encircled in this case. 
Hence, a sufficient condition for stability is 

l+HG(jco) > HAG(jco) for all co 

or 

1 > [l+HG(ja>)r 1 [HAGO‘co)] , for all co (47a) 

This condition may be extended to the matrix case as follows, [11, 14, 15, 18), 

1 > max X { [I+HG(jco)] -1 HAGO'co) } . (47b) 

CO 

where X{*} denotes the square root of the largest eigenvalue of matrix {•}*{•}. Note, 
however, that this can be an extremely conservative condition, since it is possible for the 
vector HAG(jco) to be much longer than l+HG(jco) without causing an encirclement See 
Fig. 5b for a stable situation which violates (47). However, a more fundamental 
limitation of these results is due to the fact that Fig. 5a does not describe most physical 
situations if the e t (s) term in (35), (36) has been ignored. The significant term e t is 
composed of the same source of errors which make up AG(s), (See that J(s) is a model 
order error term which appears in both eqns. of (35). Hence, the above stability results 
are extremely conservative and they ignore e t (s). Therefore they do not readily extend to 
include performance guarantees. Stability is the most studied subject in control, but 
stability is usually not sufficient for successful operation. 

CRITICISM OF THE "MODELING PROBLEM" 

The traditional idea of the "modeling problem" is as follows: 
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The Modeling Problem: 

Find a set (or a class) of differential equations Si describing the dynamical 
relationships between the response z(t) and the (unspecified) inputs {controls u(t), 
disturbances w(t), and initial conditions}. 

The flaw in this task statement is the presumption that there exists a set of 
differential equations which relate z (t) and u(t), irrespective of u(t). We argue that any 
set of differential equations S x is only an approximation of the physical phenomenon S 0 , 
and it was shown in (28) that the errors associated with this approximation cannot be 
assessed, qualitatively or quantitatively, independently of u(t). In other words, 
knowledge of the control inputs u(t) are required in any assessment of model fidelity. In 
modeling and identification literature it is common to talk about model errors with 
respect to a truth model, 5. There is no truth model (5j *S 0 ). The model and its 
controller should be discussed as a pair. They have no significance separately. 

CRITICISM OF THE "CONTROL PROBLEM" 

The traditional idea of the "control problem" is as follows: 

The Control Problem: 

Given the set of models £ which describe the dynamical process, find an 
appropriate control u(t) or controller u(z (t), t) to meet a specified set of control 

objectives. 

The flaw in this task statement is the presumption that the class of models £ which 
appropriately describe the process exists independently of knowledge of u(t), or 
consequently, of knowledge of the controller generating u(t). Now the control law cannot 
logically be specified prior to model development. Thus, if one wishes to squeeze the 
best possible performance from the controller design, then one cannot ignore the 
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MODELING PRINCIPLE V: 

The modeling and control problems are not separable and are necessarily 
iterative. 

This means that for any given G(s) + AG(s), in Fig. 5 the development of a G(s) 
and H(s) is an iterative process. 

Several implicadons of the modeling and control inseparability principle are: 

(a) The phrase "model of the plant" is a misnomer. We must refer to a model as being 
appropriate under the influence of a particular controller. Hence, we must refer to 
a (model, controller) pair as appropriate or inappropriate for each other, with 
respect to a given plant. 

(b) Only "local" properties can be stated concerning the model and the controller. 
This means that the interpretation of both classical and modem control theory must 
be tempered with this knowledge, since parameters of neither plant model nor 
controller can be taken to infinity (or wide ranges). Three examples follow. 

(bl) The Root Locus theory presumes a fixed plant while the controller gain goes to 
infinity. But the fidelity of the plant model depends upon the control gain. Hence, 
the same model of the plant is not appropriate at both the vicinity of the open loop 
poles and the open loop zeros, and a given root locus plot is never reliable in the 
vicinity of both the open loop poles and the open loop zeros. 

(b2) The Nyquist plots are reliable only over a limited frequency range and certainly 
not reliable in the vicinity of the origin, where co -» «>. If this region of uncertainty 
extends to a unit radius around the origin, then even the stability results of the 
Nyquist plot are suspect. See Fig. 6. 

(b3) In LQG theory it is presumed that the model is fixed and that the weights in the 
performance index may be varied over wide ranges. This generates the theoretical 
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predictions of maximal accuracy in Fig. 7 (solid curve). However, the actual 
performance follows the dotted curve, due to modeling errors (note the decreasing 
effect of modeling errors as the control effort decreases). Thus, the deviation of 
the actual performance from the theoretical is greatest where maximal accuracy 
predictions are made. Hence there is often a large discrepancy between achieved 
and predicted maximal accuracy, and large errors in the value of the control effort 
at which maximal accuracy occurs. These inequalities always hold, from (9c, 9d). 

u* < u t 2 , y 2 > y 2 • 

The class of all stabilizing controllers for a given plant model is described in 
[25] and its references. The advantage of this knowledge is that the total design 
freedom is characterized for accomplishing perfonnance beyond stability. 
However, the given model might not accurately describe the plant over "all 
stabilizing controllers (for the given model),” due to MODELING PRINCIPLE V. 
Hence, the "robust controller" based upon the given model might actually 
destabilize the actual system due to the fact that when the observer based controller 
is far away from its nominal design (for the given model), the model that 
accurately portrays the plant is not close to the given plant model. Much more 
work is required to capture the class of observer based controllers which allow 
appropriate changes in the model as a function of the controller. 

A "three-model" control theory would serve to keep control designs honest: 
Model 1: an Evaluation Model used to simulate the real system (until prototype 
testing is available); Model 2: a high order model for anlytical predictions of 
controlled performance; and Model 3: a low order model for control design. 

"Honesty" is maintained by maintaining distinction between the three 
models. Model 2 can never be equal to model 1 due to Modeling Principle IV. 
Model 3 must change as a function of controller design due to Modeling Principle 
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V. Dangerous (faulty) precdons occur whenever any two of these models are 
coincident, for in such cases control design mathematics can easily use high gain 
controllers to achieve a high level of performance without paying the (inevitable) 
penalty for modeling errors, see conjecture (9). See the "Controller data base" S 0 , 
Sj, S 2 , in Fig 6a. 

4.0 The Structure of Errors in The Closed-Loop System 

Suppose our hypothetical "physical system" (1) is now driven by the linear 
dynamical controller with transfer matrix H(s) 

u(s) = H(s)z (s) (37) 

Without loss of generality we may associate a state space realization with (37) and write 

H(s) = G(sI-A c )“ 1 F (38) 


or in state form 


x^A^ + Fz, x c eR Tlt 

» = Gx e . ueS* (39) 

In this Section we ask how the poles of the closed loop system behave as either G or F 
approaches zero. Next arbitrary gains G and F are considered but with a restrictive 
assumption about parameter errors (e p = 0). It will prove convenient to write results in 
terms of a set of nonzero auxiliary matrices (A, B, M) which can always be found 
satisfying 

A + BG - FM = A c (40) 


for any given A c , G, F. 
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Fig. 6a Controller Data Base 



Fig. 6b Nyquist Diagram 



Fig. 7 Performance Evaluation (with "Truth" model) 
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The system (13) is now driven by controller (21). Subtract (39) from (17). This 
gives a differential equation for x = x R -x c . Now writing (13), (39) in terms of the 
states x and x yields the homogeneous part of the closed loop system 


* * 
X 


A +BGEjo 

-5G 

* ^ 
X 

A 


(A rt -FM t )Eo, + (AA-FAM+ABG)E J 

I A - FM - ABG 

X 

Vi / 


1* ^ 
y 


C 

|0_' 

* ^ 
X 

u 

h J 

” 

GEio 

|-G 

X 


where E 10 = [I^ 0] and % = [0* T). 

In the limit as G — » 0, the eigenvalues of (41) become those of the block diagonal 
matrices A and A - FM. This conclusion is summarized as follows. 


Theorem 1: In the limit os G — » 0, any linear system (13) driven by any controller of the 
form (38), has the eigenvalues of A and (A - FM). 

Theorem 1 suggests that the low gain controller (characterized by small G) is 
stable if the open-loop system is stable and if the matrix (A - FM) (which is entirely 
under the design of the analyst) is stable. 

Now multiply (39) by E^), then subtract this equation from (13). This defines the 
vector X = x-Ej^. The homogeneous pan of the closed loop system (13), (39) can now 
be described in terms of states x and x c , yielding 


x 


1 

1 

1 

E 1 t 0 (AA+ABG-FAM) + EjjCAjR+BjG) 

L FM 

f A + BG + FAM 



( 42 ) 


* 

y 


C\CeJq 

r ■* 

X 

u 

L J 


o 

o 
1 
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In the limit F -+ 0 the eigenvalues of (42) become those of the block diagonal 
matrices A and A + BG. This conclusion is summarized as follows. 

Theorem 2: In the limit as F — ► 0, any linear system (13) driven by any linear controller 
of the form (39) has the eigenvalues of A and (A + BG). 

Again, the reader is reminded that A + BG is under the design of the analyst. 

For further insight into the effects of modeling errors suppose that the gains G and 
F are not small, but the parameter errors are zero (AA = 0, AB = 0, AM = 0). Then the 
following is true. 

Theorem 3: In the absence of parameter errors (AA = 0, AB = 0, AM = 0), the closed- 
loop system eigenvalues are those of (A-FM), A T , and (A + BG), if Xj is either 
uncontrollable or unmeasurable. 


Proof: Letting (AA = 0, AB = 0, AM = 0) in (42) yields 


A — eJqFM I Ejj (A tr+B jG) 


FM 


A + BG 


x 

*c 


P n 

y 


CiCEjS] 

* < 
X 

u 

l J 


1 

o 

O 

A 


(43) 


The block diagram of the homogeneous part of the closed-loop system using the notation 
of (15) and (39) is given in Fig. 8. Using the definitions of E 10 and E^, (43) is further 
expanded as follows. 


t 
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*'r 


Ar“FM r 

A — FA/yj 0 


*R 



A X r 

A ] A + B 


*T 

d 


FMr 1 

¥My * A + BG 


. X c. 


(44) 


y 

u 


Cr 

CtJCr 

.0 

0 ' G 


*"r 

*T 



We shall first let x T in (15) be uncontrollable. This is equivalent to the statement that 
(A TR » 0, Bj - 0) in (15) and in Fig. 8. (Recall the controllable canonical form). This 
makes the upper right matrix of (44) zero (using the dotted line partitions). In this event 
the eigenvalues of (44) become those of (A+BG) and those of the upper left block matrix, 
(using again the dotted line partitions). But since A XR = 0 this upper left matrix is now 
also block diagonal, and therefore has the eigenvalues of (A R -FAf R ) and A x . In the 
absence of parameter error A R = A, A/ R = M and the first part of the theorem is proved. 

For the second part of the proof let Xj in (15) be unmeasurable (unobservable from 
the measurement z). This is equivalent to the statement that (A RX = 0, Af T = 0) in (15) 
and in Fig. 8. (Recall the observable canonical form). This makes the upper right matrix 
of (44) zero using the solid line partitions. In this event the eigenvalues of (44) become 
those of (A r -FM r ), and those of the lower right partition, again using the solid line 
partitions. But since Mj * 0 this lower right matrix is now also block diagonal, and 
therefore has the eigenvalues of A x and (A+BG). Now in the absence of parameter 
errors A R = A R , A/ R * M. This gives the same set of eigenvalues (of (A-FM), A T , and 
(A+BG)) as in the first part of the proof. Hence, the theorem is proved. # 

A clear discussion of the concepts in Theorem 3 first appeared in [29] using modal 
coordinates and the phrases "observation spillover" and "control spillover", (signals "a" 
and "b” in Fig. 8 respectively). While such phrases are descriptive, they disguise the fact 
that the "controllability" and "observability" of the states they x T are the important 
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concepts. Hence, we prefer the standard "controllability, observability" description of 
the phenomenon to the phrase "spillover". 

Theorems 1 - 3 suggest that it is important that the matrices {(A-FM), (A+BG), 
A t ) be asymptotically stable, and more significantly, that this conclusion remain 
independent of the methods by which the controller parameters (A e , F, G) were derived! 
(Notice that the optimal LQG controller stabilizes (A-FM), (A+BG)). Suppose the 
controller (39) was designed based upon the assumed model (A R , A/ R ,B R ) so that this 
model had certain desired behavior in the closed loop. Fig. 8 makes it clear that, in the 
absence of parameter errors, the fundamental cause of deviation in system behavior from 
predicted behavior is the relative degree of controllability and observability of the states 
.x x . While controllability and obsen'ability prove to be of great benefit for control in the 
absence of model errors, complete controllability and observabilty would be a serious 
handicap in the real world. Control designers are indeed fortunate that most of the 
"deleted states" associated with "real world x T " are uncontrollable and unobservable. 
Otherwise there would be even fewer successful control designs to celebrate in practice. 

Modeling Principle VI: (Uncontrollability and Unobservability of Dynamic Systems) 

Using any number of sensors and actuators, the physical plant will not be 
completely observable nor controllable. 

This proposition requires some explanation since observability and controllability 
are mathematical properties associated with a mathematical model, whereas the "physical 
plant" defies exact description by any mathematical model. Suppose one improves a 
given mathematical representation of the physical plant by adding additional dynamics 
which were originally ignored in the model. As one continues this process, adding more 
and more details so that the new model more accurately models the physical plant, the 
mathematical model eventually becomes both uncontrollable and unobservable. In other 
words, an uncontrollable, unobservable model can always be constructed to provide a 
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closer representation of the physical plant than any controllable, observable model. It 
takes little convincing to see that this argument is correct. Taken to an extreme it is 
perhaps obvious that the molecular motions in an aircraft wing or the seat cushion 
vibrations in the aircraft cannot be controlled by aileron actions nor observed by rate 
gyros. However, one need not resort to such extreme examples, using infinitesimal 
effects. Note that the example in Fig. 9 describes an uncontrollable system, and the 
uncontrollable part is a nontrivial part of the system dynamics. The control is the torque 
applied to the reaction wheel. 

Recall that if (A,B) is controllable [or (A,C) is observable] then A is 
asymptotically stable if and only if there exists a positive definite solution to 
0 = XA*+AX+BB* [or to 0 = KA+A*K+C*C]. The important impact of the lack of 
observability or controllability is that the stability proofs which rely on observability or 
controllability cannot be used to assure that the physical plant will be stable. Indeed, 
stability is a mathematical concept relating to a mathematical model, and hence the 
physical plant can never be proved stable by mathematics. (Due to the fact that the 
physical plant defies exact mathematical description, and the amount of modeling 
precision required to predict stability is not known a priori). We can only say that the 
model is stable subject to a given range of parameter values, or a given magnitude of the 
model error vector e x , etc. 

Modeling Principle VI seems to be at odds with the notion of generic 
controllability, observability [24]. This result [24] states correctly that adding arbitrarily 
small numbers to every element of (A,B,C) will make the matrix triple controllable and 
observable. Hence, the notion of generic controllability, observability suggests that 
physical systems (such as Fig. 9) are arbitrarily "close" to observability controllability. 
This is consistent with the view of at least half a dozen control texts which state 
emphatically that all real world plants are controllable and observable. But the addition 
of arbitrarily small numbers in every element of (A, B, C) is a mathematical exercise that 
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has no basis in the physics of the problem and cannot describe physical behavior. For the 
physical system in Fig. 9, a control which can simultaneously regulate both 0(t) and \jr(t) 
to arbitrary values would contradict the conservation of angular momentum. Hence, we 
can safely claim that this system is arbitrarily "far" away from controllability rather than 
arbitrarily "close" as the notion of generic controllability suggests. Rather, we should 
interpret the generic controllability result as good reason not to believe computer 
calculations of controllability, observability, since roundoff errors are the equivalent of 
adding small errors to every element of (A, B, C), as in the thesis of generic 
controllability, observability. A common reaction of those that insist that physical 
systems are state controllable is "Who would design an uncontrollable system?" 
Actually, it is impossible to do otherwise. The physical system will always be 
uncontrollable (and that is good news not bad news), but we should not discard the 
uncontrollable part in the control design process. This is verified as follows. 

It can be shown that the stable, uncontrollable, observable system 
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with the control law 


u = GjXj + G 2 x 2 


where Gj is the standard state feedback gain for optimizing the plant (A n , Bj, Cj) and 
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G 2 satisfying 0 = KA 22 + [A n + BiGJ^K + K n A 12 , G 2 = -R -1 B T K depends upon (A 22 , 
A n , B lt Gj) and is not zero unless x 2 is unobservable in y. Hence, the often heard 
argument "the uncontrollable part of the system should be deleted since we cannot 
control it" is faulty logic. The optimal controller (or any other reasonable controller) will 
alter its control of Xj (the controllable part) with the knowledge of the dynamical 
interactions between Xj and x 2 , even though x 2 itself cannot be controlled. For example, 
the minimal energy optimal control that pushes x x toward a certain desired value Xj 
might be zero if the dynamics between xj and x 2 are such that Xj is naturally (without 
control action) driven toward Xj. If x 2 were deleted a priori the optimal control would be 
greater than zero, hence not optimal for the complete system. 

To illustrate the relationship between controllability observability and stability 
consider the second order system in modal coordinates. 
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Note that mode x T is observable (controllable) if and only if A/ T *(B T *0). Now 
consider any first order controller 

u(s) = H(s)z(s) 


where 


H(s) = G(sI-A c ) _1 F 

and where we have chosen to describe A c in the form 

A c =A r + 2? r G-FA/ r 


where G is chosen so that 
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Ar + B r G < 0 


and F is chosen so that 


A r — FA/ r < 0 . 

The necessary and sufficient closed loop stability requires the product of controllability 
and observability of Xj to be limited by 

IA/tB-j-I < IAr-FA/rI 1/4. t I I^r+BrGI (46) 

This simple third order example provides important insight into the necessary and 
sufficient condition for stability. The closed loop will be stable if the neglected part is 
unobservable (A/ T = 0) or uncontrollable (B T = 0), as promised by the sufficient 
condition, Theorem 3. But (46) also shows the upper bound on IA/ T B T I which will 
allow stability even if A/ T , B T are not zero. Hence, the right hand side of (46) is a 
measure of the conservatism in Theorem 3, and this measure is a function of both the 
modeling errors (A T ) and the control gains (G, F). 

Control Design Considerations: (Trading Stability and Performance) 

Practical control designs always require some iteration or some fine tuning during 
experimentation with the real hardware. There are two fundamentally different strategies 
for these two phases of design. These strategies differ by the manner in which they 
handle stability and performance concerns. 

The Stability Design Scenario: 

Design phase A: Design for stability 
Design phase B: Tune for performance. 


41 


The Performance Design Scenario: 

Design Phase A: Design for performance 
Design Phase B: Tune for stability 

"Stability Design" includes (i) guaranteeing specified gain and phase margins by 
root locus. Bode, Nyquist, (ii) designing by pole or pole region assignment, (iii) 
designing by Liapunov techniques, (iv) designing by H°* techniques, etc. "Performance 
Design" includes satisfying input or output variance (or Lj) bounds, perhaps by optimal 
control. 

Phase B is intended to be a simple gain change. In the "Stability Design" scenario 
the gain is usually tuned up a bit to improve performance without destroying stability. In 
the "Performance Design" scenario the gain is usually turned down a bit to improve 
stability margins while maintaining acceptable performance. 

Of course, the design tools have been oversimplified in these scenarios, in order to 
draw distinctions among points of view. There are circumstances where each scenario is 
preferred. In some situations in the Stability Design scenario, adequate performance may 
be impossible to achieve by a simple gain change in Phase B. In this case the 
Performance Design scenario is preferred. In some situations with the Performance 
Design scenario, adequate stability margins may be impossible to achieve by a simple 
gain change in Phase B. In this case the Stability Design scenario is preferred. An 
overwhelming proportion of control literature has focused on stability to support the 
Stability Design scenario. However, as society demands more performance in modem 
systems, the development of practical and theoretical tools which focus more on 
performance will strengthen the Performance Design scenario. 

Stability and performance are competing partners in the design process. They 
usually do not naturally cooperate. Improving one usually (but not always) degrades the 
other, and good tradeoff methods are needed from the research community. In fact, there 
are many examples where stability, performance, and sensitivity are three competitors in 
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the design process, where design changes that improve one of the three necesarily 
degrades the other two. Modeling and control design theory has not yet provided a 
convenient and practical tradeoff among these three design goals. Part of the problem is 
lack of an agreement about what kind of sensitivity, stability, and performance measures 
to use. For example, in [33] we have the following conclusion 


Theorem 

Let Xj, i=l,2,..,m, denote the distinct eigenvalues of A. Define a measure of root 
sensitivity by 

s =S l, lr 11 . H[# = tr [•]*[•] . 

i=i 9A 


The lowest bound on root sensitivity is S £ n and S = n 1 / and only if A is normal, 
(AA t = A t A). 


Similar results are available for 


11 

a A 
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types of sensitivity [34]. 


For a simple pitch control problem for an aircraft it was shown in [33] that minimal root 
sensitivity in the closed-loop system is achieved (that is, normality is achieved 
(Aj-AJ = AjA c , A c — A + BG) only at values of G which were destabilizing. 
Furthermore, as a design objective a "nearly normal" objective can be added subject to 
performance or stability constraints. In this event the design can depart arbitrarily far 
from minimal root sensitivity (even though the design is "nearly" normal). This is due to 
the fact that the "abnormality" measure a = llA c Aj - AjA c ll can be a convex function of 
the parameters in A c even when the sensitivity measure S is not. This means that "nearly 
normal" does not mean "nearly minimally sensitive". The "nearly normal" design 
objective has been widely used in both frequency and time domain designs. It has 
popular appeal due to the fact that symmetric matrices are normal, easy to work with, and 
have orthogonal eigenvectors. However, such examples in [33] point out that while 
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normal matrices (or orthogonal eigenvectors, or symmetric matrices) coorespond to 
global minimal root sensitivity, a "nearly normal" design goal is suspect since "nearly 
normal" does not coorespond to "nearly insensitive". 

Since root sensitivity is not easily incorporated in designs that minimize other 
types of performance, some authors seek to incorporate different kinds of response 
sensitivity. The max entropy approach of [30] minimizes an L 2 norm of the inputs and 
outputs while modeling the uncertain parameters as zero mean white noise. This leads to 
two Riccati-like plus two Liapunov-like equations to solve by iteration. A case study of 
this approach is presented in [32], by comparing it with a different optimal controller, 
one that minimizes the L 2 norm of nominal performance plus a weighted L 2 norm of 

input and output sensitivity ll-^-II 2 , ii-t^ii 2 where p is the vector of uncertain parameters 

op op 

(not random). These conclusions emerged from these case studies (comparing methods 
[30], [31], and [32]): 

(i) Presently method [31] cannot treat uncertain parameters appearing in the 
measurement matrix, method [30] cannot treat parameters p ; appearing both in B 
and C, while method [32] can do both. 

(ii) Presently method [30] cannot treat parameters p* appearing nonlinearly in (A, B, 
C), while methods [31] and [32] require only differentiability of (A(p), B(p), 
C(p)) with respect to p. 

(iii) When comparing Lj performance of inputs, outputs with stability margins, the 
case studies favored the results of method [32] over methods [30] and [31] in 
most cases studied, but ranked the results of method [32] equivalent to the results 
of method [30] in two of the cases studied. 

Case studies should be encouraged in the areas of robustness and model error 
compensation since applicable theory is quite limited, and case studies can point the way 



VI) The physical plant is always unobservable and uncontrollable (fortunately). 

The impact of each of these principles is discussed to explore limitations of 
available theory. An understanding of these principles can aid in the search for 
successful control designs and for improved "robustness" definitions. 
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A Sensitivity Controller For Uncertain Systems 



Kenji Okada and Robert E. Skelton 


Abstract 

In this paper a new controller design, which we shall call the "Trajectory 
Sensitivity Optimization" method is presented to improve the robustness for parameter 
variations. The method uses the sensitivity trajectory to model the parameter uncertainty 
and introduces a special quadratic cost function involving an input and output sensitivity 
term. 

Necessary conditions are derived to obtain the dynamic controller. The necessary 
conditions consist of two Lyapunov equations and two controller gain equations which 
have no closed form solution. Therefore a special iterative algorithm was developed to 
obtain the numerical solution. 

The method can deal with a wider class of parameter uncertainty than existing 
methods. Numerical examples show that the method is effective in improving the 
robustness to parameter variations. 

1. Introduction 

The LQG theory is well established as a multivariable control design synthesis, but 
it suffers from a poor sensitivity to certain classes of plant parameter uncertainty [1|. 
This sensitivity problem for parameter uncertainty becomes extremely important in 
flexible structure control where there is large parameter uncertainty. To cope with the 
problem various design syntheses have been proposed. 

We are motivated by the trajectory sensitivity approach of Yedavalli and Skelton 
[13) where the necessary conditions are described to solve our problem. By restricting 
our attention to controllers of order n (equal to plant order) we will be able to make 
further progress toward solutions. 
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The sensitivity controller proposed by Wagie and Skelton [2] uses a trajectory 
sensitivity model to include the effects of parameter uncertainty and a special cost 
function involving both an output and input sensitivity term. This paper shows how to 
reduce the sensitivity model to tractable order, while preserving the correlations between 
outputs and all their sensitivities. The main drawback of this sensitivity controller design 
method is that the method does not deal with parameter uncertainty in the measurement 
matrix. 

The Maximum Entropy Method has been applied by Hyland and Bernstein to the 
flexible structure problems [3]. This method uses a stochastic modeling for the 
parameter uncertainty in order to improve the robustness for the parameter variation. 
The design synthesis provides a direct method to the design of robust, reduced order 
controllers in which robust controller design and controller order reduction are performed 
simultaneously. The necessary conditions obtained by this method consist of two 
modified Riccati equations and two modified Lyapunov equations coupled by stochastic 
effects. Two restrictions of the method relate to the structure of the parameter 
uncertainties permitted. The uncertain parameters must appear linearly in the plant, input 
and output matrices. It also requires that the control-and-measurement-depcndent 
uncertain parameters are uncorrelated. Because of this requirement the method cannot be 
applied directly to the problems in which there exists parameter uncertainty that affects 
the control matrix and the measurement matrix simultaneously. The method also cannot 
deal with parameter uncertainties in the disturbance matrix and in the output matrix. 
This may cause the unnecessary degradation of the closed loop system performance. 

The approach developed by Tahk and Speyer [4| is called asymptotic LQG design 
synthesis. This method uses the internal feedback loop to model the parameter variations 
and serves to improve the stability robustness and reduce the sensitivity to parameter 
variation. This approach is a generalization of the LQG/LTR technique introduced by 
Doyle and Stein [31. The approach has difficulties when there exist parameter variations 
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in the input matrix 8 or in the measurement matrix M. In this case the method requires 
augmentation of the state space so that AB and AM are embedded in the state matrix of 
the augmented system. This augmentation of the state space eventually leads to the 
increase of controller order. 

As explained so far, the existing robust controller design methods for parameter 
uncertainty have some restrictions on the structure of parameter uncertainty. Hence, the 
main purpose of this paper is to propose a new robust controller design synthesis which 
can deal with wider classes of parameter uncertainty. The proposed method uses 
trajectory sensitivity to model the parameter uncertainty and introduces the special cost 
function which includes the output and input sensitivity terms in addition to the nominal 
input and output cost. The controller parameters are determined such that the given cost 
function is minimized. Through this minimization procedure, the controller obtains a 
robustness property with respect to parameter variation. The fundamental idea of this 
method is the same as the Wagie, Skelton Method, although the approach to obtaining 
the controller is different. 

This paper is organized as follows. Section 2 discusses the modeling of parameter 
uncertainty using a trajectory sensitivity model. Section 3 introduces the newly 
developed "Trajectory Sensitivity Optimization" method and provides the necessary 
conditions for the sensitivity reducing controller and the algorithm to obtain the solution. 
Section 4 deals with the numerical examples to demonstrate the effectiveness of the 
proposed method and provides performance comparisons with other design methods. 
Finally Section S contains conclusions. 

2. Modeling of Parameter Uncertainty 


2.1 Trajectory Sensitivity Model 
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In this section we derive the trajectory sensitivity model for a simply supported 
beam example which will be used in the numerical example in section 4. 

Assume there arc h uncertain parameters Pi, P 2 . * * * . Ph and a space-state model of 
the system is given by 


x = A(p)x + B(p) u + D(p) w 

(2.1) 

y = C(p)x 

(2.2) 

z = M(p) x + v 

(2.3) 


where x, y, z, u, w, and v are respectively, state vector of dimension n, output vector of 
dimension k, measurement vector of dimension l, input vector of dimension m, zero- 
mean white noise of dimension d with intensity W, and zero-mean white noise of 
dimension / with intensity V, and p is given by p = [p lf pj, • • *, Phl T 
Then the resulting sensitivity system can be expressed as follows: 


^ x g ® A, x, B g u g + D f w g 

y. ^c.x, 

z, = M,x, + v, 


(2.4) 

(2.5) 

( 2 . 6 ) 


where: 



M 

fy 1 


M 

f«l 


fvl 

X =' 

A l 

L x pJ 

• y - = Uj 


W 

' " I u pJ 

v 

II 

•1 

> 

11 

M 



l°J 


A.= 


A 0 
\ 


B.= 

‘b 0 " 

B p B 

fl 

m 

U 

c o' 

.CpC 

It 

Q 

d" 

. D P 

M.= 

'm o’ 

M p M 


[ 7 ] £ block diag { [ * 1, — . t * 1) 

T 


m.4 


£LJL...£Li1I 

3 pi Jph 


where the matrices A g , B„ C,, D„, and M, are evaluated at p = p (nominal vector value of 
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p). The basic idea to improve the robustness for parameter variations is to use a cost 
function V D given by 

V D = E„ [y T Qy + u t Ru + £ (yj&y^ + u^R^)] (2.7) 

i=l 

where y T Qy + u T Ru is the pan of the cost function for the standard LQG design, and 

h T 

LCypiQiypi + UpiRiUpi) are added sensitivity terms. We seek a controller which minimizes 

fcsl 

the cost function V D . Then the sensitivity of the controller to the parameter uncertainty 
Pi is reduced in increasing the norm of weighting matrices Qj and Rj. 

The nominal LQG controller is obtained by setting Qj, Ri to zero, and a controller 
which minimizes sensitivity only (neglecting nominal performance requirements) is 
obtained by setting Q, R to zero. 

22 Trajectory Sensitivity Matrices for An Example 

In order to construct the trajectory sensitivity matrices A„, B„, C_, D d , and M„ for 
the physical system, we deal with an Euler(Bernolntimply supported beam shown in Fig 
1. This example, is used later for sensitivity reducing controller design. In this example 
we take the following three quantities as the uncertain parameters. 

(1) p : Mass Density of Beam (per length) 

(2) El : Rcxual Rigidity of Beam 

(3) K, : Actuator Gain 

It is well known that the natural frequency coj and mode shape *F,(r) for i-th mode 
of a simply supported beam are given by 
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“>=Vf<r )2 ' ' t '.«=V^ !, " ( r r) 

(L: Length of Beam) 

If we assume a torquer at r = r c , a linear displacement measurement sensor at r = r m , and 
a linear displacement at r = r 0 for the output, then we obtain the following equations of 
motion: 


qj + 2C i to i q i + cofo = bj(u + w), (i = 1 n) 

y = H(r 0 .t)=LH / i (r 0 )q i 
i=l 

Z = U(r m , 0 +v= Z'PiO’m) qi + V 

i=l 

where: b, = = •^•4 / i (r) I i=r. = ^ cos(itt/L)r c 

If we choose the state variable x by x = [q T , q T ] T , then the above equations can be 
transformed into the following state-space expression. 
x = Ax + Bu + Dw, y = C x , z=Mx + v 

where 


A = 


0 I 
->2 


Q*diag [co„ • • • , taj scaler 


-2^QJ 

B = [0 • • • 0 <t>i( rc ) • • • d> n (r c )l T - [ 0 Bjl T 
C = L v P l (r 0 ) • • • 0 • • • 0 | — (C y 0J 
D = B 

M = tH* |(r m ) • • • *P„(r m ) 0 • • • 0 J - lM t 01 


Using the above plant model, we obtain the following trajectory sensitivity matrices Apj, 
B pi , Cpj, Dpj, and Mpj for the three uncertain physical parameters. 
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Uncertain Parameter p : p» » — Pt = 1 

(P)nom 



0 o’ 


0 


1 

N,- 

^NOM C^NOM 

• B » = 

”( B u)nOM 

•Cp,= 

— y (Cy)NOM 0 


D Pi = Bp, • Mp, = 


— 2 (My^QM 0 


• Uncertain Parameter El : p 2 = 


El 


(EI)nom 


* P 2 ~ 1 


0 0 
“&NOM “C^NOM 


*p 2 = 

*» = <>• <^-0. ^ = 0 Mp, = 0 


• Uncertain Parameter K. : P 3 * 


K. 


( k *)nom 


. P3=i 


Apj = 0 Bpj = B C^O Dpj = 0 Mpj = 0 


3. Trajectory Sensitivity Optimization Method 

In this section we introduce our controller design synthesis which reduces the 
sensitivity to parameter variations of the plant, using the trajectory sensitivity derived in 
section 2. The basic idea of this method is similar to that of the Wagic and Skelton 
method [2], but the advantages of the new method are: l.) The order of the controller 
(the number of states) obtained by this method is smaller than that of the Wagic and 
Skelton method. 2.)The method can deal with a wider class of parameter uncertainty 
than the Wagie and Skelton method (The Wagie and Skelton method cannot deal with 
the parameter uncertainty related to measurement matrix, i.e., M p term) 
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The method has these advantages over the Maximum Entropy Method: It can deal 
with (a) parameter uncertainty in a nonlinear manner, (b) parameter uncertainty in the 
disturbance matrix D and the output matrix C, and (c) parameter uncertainty appeared 
both in control matrix B and measurement matrix M at the same time. The advantage of 
the Maximum Entropy Method is fewer equations to solve. Some discussion of the 
convergence of the method appears in [12]. 

3.1 Problem Statement 

We consider the following problem. 

For the system 

• n-th order Plant: 


x - A(p)x + B(p) u + D(p) w 

(3.1) 

y = C(p) x 

(3.2) 

z = M(p) X + V 

(3.3) 


where, 

E(w(t)w(x) T ) = W5(t-t) E{v(t)v(t) T ) = V8(t-t) 
E(w(t)J = 0 E ( v(t) } = 0 
p = (p,, • • • , Ph) Uncertain Parameters 

The n-th order Controller is: 


u = G x c 

(3.4) 

x c = A c x c + B c u + F(z- M c x c ) 


= (A c + B C G-F M^Xc + Fz 

(3.5) 


where. 
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A c = A(p) B c = B(p) M c = M(p) 
p : Nominal Value of p 
dF _ A dG _ n ... . 

©Pi dpi 


Find F and G such that the cost function V D is minimized. 



(3.6) 


If we set pi = Poj, where CTj = I Apj I is the magnitude of the expected variations in p it 
then we need only to determine p, Q, and R as design parameters. The weight P is 
usually determined through tradeoff between the robustness to parameter variation 
and the nominal performances of input and output cost the weights Q and R may be 
determined under nominal conditions (<Tj = 0) to satisfy E^yj 2 £ oj 2 , for a specified Oj, 
i=l, 2, ..., n y while minimizing u T Ru. The algorithm for such weights is given in 
chapter 8 of [10]. 


3.2 Derivation of Necessary Conditions 
Let 

* = x - x c (3.7) 

then the equations for the closed loop system are given by 

X = (A-FM)X + {(A+BG-FM) - (A c + B C G-FM C )) x c + Dw - Fv (3.8) 
x c = FMX + (A C + B C G + F(M-M C )) x c + Fv (3.9) 

y - c x + c x c (3.io) 

u=*Gx c (3.11) 

The above equations are transformed into the following matrix forms: 



10 


*«-A t x # + D,w, 
y = c, x, 
u = G,x a 


where : 


x 

A t 


ftl 

‘ W 

r-i 

| 

l X cJ 

1 

l v J 


A.= 


A-FM (A+BG-FMHA C +B C G-FM C ) 
FM A c +B c G+F(M-M c) 


D.= 


D -F 
0 F 


C. = [C C] G, = [0 G] 


Let 


x, = [x T xj xj X^ xj xi) T 

y, = [y T U T yj ujj yj ujf 

w, = [w T v T J T 

then the closed loop sensitivity system is expressed by 

x, = A t x, + D f w, 

y. = c,x. 


where 


(3.12) 

(3.13) 

(3.14) 


(3.15) 

(3.16) 

(3.17) 

(3.18) 

(3.19) 
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Aq 0 

0 

• 0 




A, Aq 

0 

•• 0 



A,= 

: o 

Ao 

• 0 




A h 6 

0 






A(p)-FM(p) 

0 


A<) = A t (p) — 

FM(p) 

A(p)+B(p)G 



9(AJ . 


A pi (p)-FMp i (p) A pi (p)+B pi (p)G-FMp i (p)| 

A i= 

dPi 

FMp^p) 

FM pi (p) 


(3.20) 


( 3 . 21 ) 

( 3 . 22 ) 


(i=l, h) 


' D -F 
0 F 
D p , 0 


C C 0 0 ... 0 0 

0 G 0 0 ... 0 0 

^>i Cpi c C ... 0 0 

0 0 0 G ... 0 0 


(3.23),(3.24) 


Dpi, 0 

. 0 0 . 


• • * * ••• • « 

^ph ^-ph 0 0 ... C C 

0 0 0 0 ... 0 G 


From the triangular structure of A, in Eq.(3.20) note that the poles of the 
closed loop sensitivity system given by Eq.(3. 18) are equal to those of the closed 
loop system without sensitivity states repeated (h+1) times. Therefore if the closed 
loop system is stable at p = p, then the closed loop sensitivity system given by Eq. 
(3.18) is always stable. 

The steady state covariance is defined by 
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X, = E_{x,x, t } = E„ 


XX XXg XXpj XXc^i ... XXph XphXc^ 

M T x c xj x,.xji x c x£ ( ... x c xj h x c x£ 


S. Y. M. 


XphXph XphX^ 
- T Y T 

XcjrfXph X^CpfcJ 


(3.25) 


then X, is obtained as the solution of the following Lyapunov equation: 

A,X, + X,A, t + D,W,D, t = 0 


where 


W,= 


W 0 
0 V 


(3.27) 


using the above X,, we can express the cost function Vq by 

V D = tr[X t C f T Q,C s ] 


(3.28) 


where : Q* = block diag(Q, R, PiQ, PiR PhQ. Ph^) 


(3.29) 


By augmenting the constraints (3.26) to the objective functions (3.28) by use of 
Lagrange multipliers, we introduce H given by 


H = tr [X.CjQ.C,! + tr IK S (A, X, + X, A, T + D, W, D, T )| 


(3.30) 


Then the solution to the problem satisfies the following conditions: 


iH =0 i!i=o M = o M = 0 

dx f 3K, 3G dF 


(3.31) 


Relying on standard matrix calculus, the following necessary conditions are 


derived: 
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•|£- = 0: K,A, + A l T K # + C I T Q l C I = 0 (3.32) 

dX g 

• J£* = 0: A, X, + X g A 7 + D g W, D, T = 0 (3.33) 

oK, 


•t =o: ° 


,jh+ 1 h 

_ — R~ l j gi \r v j. roT 


XK 2k X 2k + IB^K^Xz)} 

k=l k=l 


x[Xj2+ LPk^2k+1.2k+l] 

lc=l 


-1 


(3.34) 


h+l 


• -r— * = 0 : F = — (Kj!— Ki2~ + K 22 ) 1 [( S(^2k~^2k-l)^2k-l}^^ 
oF k=t 


+ S(K 2 k^-Kak + i)(Xi + X2)Mj t ]V- 1 
k=l 


(3.35) 


where K 1? .... K 2 JH .2 and Xj, . . . , X 2 h +2 are defined by 


K,= [K^ Kj ••• K i :nx2n(h+1) (3.36) 

X, = [X, X 2 • * * X t : 2n(h+l)xn (3.37) 


Xy and Kjj are the (i.j) block of X t and K,. 


X.= 


k u 


. X 


t,2h+2 


K,= 


2h+2,l ••• *2h+2.2h+2 
• K l.2h*2 


Xjj : nxn 


K 


l.l 


Ky : nxn 


k 2hf2,l 


k 2hf2.2Ju2 


(3.38) 


(3.39) 


As we can see in the definition of A,, A, contains F and G. Therefore the equations 
Eq.(3.32) - Eq.(3.35) are coupled. However these equations can be solved 
numerically by using the iterative method presented later in this section. Before 
proceeding to the iterative method, we investigate the Lyapunov equations given by 
Eq.(3.32) and Eq.(3.33). 
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The sire of the Lyapunov equations is 2n(l+h)x2n(l+h). Therefore if the 
number of uncertain parameters is big, the equations may be too large to solve 
directly. The structure of the equations, however, allows them to be solved in 
pardoned forms. 


• Partitioning of the Lyapunov equation for X, 
Let 21jj represent the 2nx2n matrix 


X 2i-1.2j-l X 2i-1.2j 
X 2tfj-1 X 2i.2j . 


and Dy be the 2nx2n matrices given by 


D 



dwd t +fvf t -fvf t 
-fvf t fvf t 


(3.40) 


(3.41) 




0 


0 

0 


(3.42) 


(i £ l j 2 2 and j £ i, D p> - D) 

then Zjj is the 2nx2n block of X g in the ij position, and Dy is the 2nx2n block of 
D § W s d 7 in the ij position. The partitioned matrices Zy (j ^ i ^ 1) arc obtained by 
solving the following equations: 


(1 , 1) block: AqZj j + Zj | Aq 1 + Dji = 0 (3.43a) 

(l,j) block: AqZjj + Z|jAj" + Zjj AjEj + Djj = 0 (3.43b) 

0-2 h+1) 

(i , j) block : AgZjj + Z^Aq" + (Aj_|Z|j + ZjjA^Ej) + Djj = 0 


(2siuj£h+l) 


k 
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Since the (1,1) block element given by Eq.(3.43a) is the standard Lyapunov 
equation, it may be solved directly for Z u . Substitution of Z u obtained in 
Eq.(3.43a) into the (1 j) block elements reduces Eq.(3.43b) to standard Lyapunov 
equations. In a similar way the (ij) block (2 £ i £ j £ h+1) elements can be solved by 
substituting Z u and Zy obtained in the previous calculations. 

Therefore the Lyapunov equation for X, of order 2n(l+h) can be reduced to 
the (h+l)(h+2)/2 Lyapunov equations of order 2n. It can also be shown easily that 
each partitioned Lyapunov equation obtained above can be partitioned further into 
four sub-block elements of order n which can be solved separately in the sequence of 
(1,1), (1,2), (2,1), (2,2) sub-block. Hence the total number of Lyapunov equations of 
order n becomes 2(h+l)(h+2). However, from the symmetric property of X, the 
diagonal block elements Zy (i=l,...,h+l) are reduced to three sub-block elements of 
order n instead of four sub-block elements. Therefore the total number of Lyapunov 
equations to be solved is (h+l)(2h+3). 

• Partitioning of the Lyapunov equation for K, 

Let Yjj represent the 2nx2n matrix 

^ K 2i-1.2H K 2i-l,2j 
Y,i ~ . K 2i.2j-1 K 2i.2j . 


(3.44) 


and Cy be the 2nx2n matrices given by 
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C i 
Ml “ 

C t QC + £ CjQkCp, C t QC + 2 

fc=l k=l 

C t QC+ JClQiCp, C t QC+ 2CiQ k C Pl + G T RG 

k=l Vc=l 

c u~ 

'<£,9 !h<= cJ. 1 Q h C 
CpnQr.C <£,9-i C 

(2SjSh+l) 

Qi- 

C t Qj_ 1 C C T Qi_iC 

C T Qi_,C C t Q w C+G t R w G 

(2SiSh+l) 

C-= i 
'“u 

0 (2 £ i < j) 




(3.45) 

(3.46) 

(3.47) 

(3.48) 


where Q R = P k Q , R k = f^R , then Yy is the 2nx2n block of K, in the ij position, and 
Cy is the 2nx2n block of CjQ/2 % in the ij position. The partitioned matrices Yy 
(j 2. i £ 1) are obtained by solving the following equations: 


(j,j) block: 

YyA 0 + Aq' Y y + Cy = 0 , (KjSh+l) 

(3.49a) 

(i,j) block : 

YyA 0 + AjVy + Cy = 0 , (l<i<j) 

(3.49b) 

(l.j) block: 

Y ijA 0 + Aq*Y ij + AjIiYjj + C t j = 0 , (2 < j) 

(3.49c) 

(1,1) block: 

Y, iA 0 + A 0 t Y, | + (Y l2 A t + Y l3 A 2 + • • • + Y l<h+ , A h ) 


+ (Y l2 A, + Y 13 A 2 + • • • + Y| .h+iAh) 1 ^ + C u — 0 . 

(3.49d) 


From Eq.(3.49) and Cy = 0 (2 £ i < j). we obtain 

Yy = 0 for 1 < i < j (3.50) 


Therefore the structure of K t is given by 
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K,= 


Y„ Y 12 Y 13 
PiY 0 0 
P2 Y o 

S. Y. M. 


>• Y lih+1 | 
.. 0 

.. 0 
PhY 0 


(3.51) 


where Y 0 is the solution for the following Lyapunov equations: 


YoAq + aJY 0 + C q = 0 



C t QC C t QC 
C t QC C t QC + G t RG 


(3.52) 

(3.53) 


The number of Lyapunov equations to be solved becomes h+2 Lyapunov equations 
of order 2n or 4h+6 Lyapunov equations of order n. When we choose 
s Pj = • • • = (3), = 0, only the (1,1) block K, becomes non-zero. In this case it can 
be easily shown that the equations for G and F reduce to the standard LQG 
equations. 


3.3 Algorithm to obtain the Numerical Solution 

Since the equations obtained as the necessary conditions are coupled, a special 
numerical algorithm is required. The approach taken is similar to that of the Wagie 
and Skelton method. The algorithm is summarized as followed: 

• Algorithm to obtain F and G 


STEP 1 Choose initial F and G 

F = F 0 G = G 0 


(e.g. use the solution for the standard LQG problem) 
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STEP 2 For the given F and G, solve X t and K, from Eq.(3.32) and Eq.(3.33) using 
the partitioned Lyapunov equations. 

STEP 3 Using X, and K, obtained in STEP 2, Calculate F and G. Set Fne W = F 
Gnew = < J 

STEP 4 If I Gnew ~ Gold * < E j ' Fnew ” Fold * < 
then Stop : Solution Completed. Otherwise set 

F = Fnew® + Fold G ■ <*). 

G = G[siEw a + Gold G * a )» 

Return to STEP 2. 

where a is the coefficient which dictates the convergence of the solution. Usually 
0.2 - 0.5 is used as the value of a. But when the weight ft to the sensitivity part of 
the cost is large, a smaller value may be necessary. 


4. Numerical Example and PerformanceComparison with other Controller Design Methods 
In order to investigate the effectiveness of the proposed controller design 
synthesis and compare the performances with other methods, we take the following 
three examples. 

4.1 Simply Supported Beam 

We have already derived the sensitivity trajectory model for a simply 
supported beam in section 2. Here we consider the same example. As numerical 
values we use the following values: 


• Beam Parameter : L » it p 
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• Sensor, Actuator, Output Position : r m = 0.30 L r c = 0 r 0 = 0.45 L 

• Noises : V = 1 W = 1 
Then co,, v P i (r), <D,(r) are given by 

ti )5 = i 2 'Fj(r) = sin (i~r) ^(r) = i cos (i-^-r) 

(i = 1, • • • ,n) 

If we choose the first 4 modes as our design and evaluation model, then we obtain 

Q = diag(l 2 ,2 2 ,3 2 ,4 2 ) B u = [1 2 3 4 ] T 

Cy = [sin(0.45 rc) sin(2x0.45 k ) sin(3x0.45jt) sin(4x0.45 Jt)] 

M r = [sin(0.30rc) sin(2x0.307t) sin(3x0.30rc) sin(4x0.307i)] 

Substituting these matrices into A, B, C, D, M and A^, B^, C^, D pi , M P( , we obtain 
the required data for the Trajectory Sensitivity Optimization (TSO) method design 
synthesis. 

If we choose Q = 1, R = l for the weights of the cost function and apply the 
standard LQG method, then we obtain the controller whose stability range for 
parameter variation and input & output cost are summarized in Table 1. As we can 
see from Table 1, the standard LQG controller is sensitive to the parameter variation 
p and El. 

We apply the TSO to the same system to reduce the sensitivity. First we 
investigate one uncertain parameter case in which only one uncertain parameter is 
considered for the design of sensitivity reducing controller. Next we deal with two 
uncertain parameter case in which two uncertain parameters are considered at the 
same time for the controller design. 
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• Sensivity Reducing Controllers for One Uncertain Parameter Case 
Controller Type A 

In this case the sensitivity part with respect to mass density variation is 
weighted for the TSO cost. Table 2 shows the stability range for parameter 
variation and input & output cost of Type A controllers for different weights 
p. When we compare Table 1 and Table 2, we see that the stability range for 
p variation increases as the weight P increases. In this case the sensitivity to 
El variation is also reduced. 

The gain margin, however, decreases as the weight P increases. Fig 2 
shows output cost performance change due to p variation for different 
controllers. As we see in Fig. 2 and Table 2, the output cost increase rate is 
maintained relatively small while the input cost increases pretty rapidly as P 
increases. Therefore tradeoff between the robustness to parameter variation 
and the input & output cost should be made to determine the appropriate 
weight p. 

• Sensivity Reducing Controllers for Two Uncertain Parameter Cased 
Controller Type B 

In this case two sensitivity terms (i.e, sensitivity terms with respect to p 
and KJ are weighted at the same time for the TSO cost. Table 3 shows the 
performance of Type B controllers Comparing the results with the standard 
LQG controller performance, we notice that controller B-2 can achieve better 
robustness for parameter variations p, El and K, also. 

4.2 Cart with an Inverted Pendulum 

Next we consider the cart with an inverted pendulum shown in Fig 3. The 
linearized equations expressed in state-variables form arc given by 
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dt 




0 1 

0 
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• % 


0 ' 

z 

z 
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0 
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0 0 
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Ml 8 





Mi. 


u 


or 


x = Ax + B(u+w) 


y=z=[1000]x=Cx 
Zm : Zm = 


J>1 


m 


10 0 0 

l*J 

' + ' 

w 


0 0 10 


x + v = Mx + v 


We consider the following two uncertain parameters: 
a) Actuator Gain (K,) : 


We obtain the following sensitivity data. 
Trajectory Sensitivity Data : p = 


K. 


( k *)nom 

Ap = 0 B p = B Cp = 0 D p = 0 M p = 0 
Maximum Entropy Design Data : 

A! =0 B; =0 M, =0 
b) Angle Sensor Gain (K„) : 


We obtain the following sensitivity data. 
Trajectory Sensitivity Data : p = 


K, 


( k $)nom 


Ap = 0 B p = 0 Cp = 0 D p = 0 M p = 


0 0 0 0 
0 0 10 


Maximum Entropy Design Data : 
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A, = 0 Bj = 0 M,= 


0 0 0 0 
0 0 10 




Numerical Values : M = 1 kg 
W» 1.0x10"* V = 1.0x10“* 


m = 0. 1 kg 

fiol 


L = 1 .0 m 


1 Q = 1 


g 


9.8m/sec 2 


• Sensitivity Reducing Controllers for K, Variation 

Three different sensitivity reducing controller design synthesis, 1) TSO 
2) Wagie and Skelton method 3) Maximum Entropy method, were applied to 
this problem to compare the performance. The performance curves (input & 
output cost versus stability range of K t variation) for different controller 
designs are shown in Fig 4. As we can see in the figure, the Wagie and 
Skelton method cannot improve the robustness for K, variation even if a large 
P is chosen. The TSO achieves smaller input & output cost than the 
Maximum Entropy method for the same stability margin. Therefore the TSO 
is best for this problem in terms of performance cost and robustness to 
parameter variation. 

• Sensitivity Reducing Controllers for K, Variation 

The Wagie and Skelton method cannot deal with the problem in which 
measurement sensitivity matrix M p is non-zero. Therefore the TSO and the 
Maximum Entropy methods were applied to this problem. The results are 
shown in Fig 5. In this case the TSO achieves smaller output cost than the 
Maximum Entropy method for the same stability margin, whereas it requires 
larger input cost than the Maximum Entropy method. Fig 5 suggests that in 
this case there is no big difference between two methods with respect to 
performance cost and stability margin for K t variation. 
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4.3 Doyle's Example 

The problem considered here was first given by Doyle [1] and investigated 
further by Bernstein in his Maximum Entropy method [7]. The required data for the 
problem are given by Plant Matrices : 



1 1 


0 


1 

A = 

0 1 

B = 

b 

C = [l 1] D = 

1 


Uncertain parameter : b in matrix B (b)^oM = 1 


Like the previous example, three different controller design syntheses were applied 
to this problem. The performance curves for different controller designs are shown 
in Fig 6. In this case the controllers obtained by the Wagie and Skelton method 
show worse robustness for parameter variation b than the standard LQG controller. 
Therefore the method is unacceptable for this problem. The TSO and the Maximum 
Entropy method (For the Maximum Entropy method, we used the results presented 
in [7]) show the similar results to those obtained in the previous example (Uncertain 
Parameter : K, Fig 5). 


5. Conclusions 


In this paper a new controller design synthesis is presented to improve 
robustness to parameter uncertainty. The proposed method uses the trajectory 
sensitivity to model the parameter uncertainty and introduces a special cost function 
to reduce the parameter sensitivity at both the input and output to the plant. The 
order of the controller is equal to that of the nominal plant. The necessary 
conditions for the optimization consist of two Lyapunov equations and two gain 
matrix equations. An iterative algorithm was developed to obtain the solution to 
these coupled equations. The large sizer of the Lyapunov equations of order 2n(h+l) 
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is reduced to several smaller equations, using partitioned forms improving numerical 
efficiency. 

The new method can deal with a wider class of parameter uncertainty than 
either the Wagie and Skelton method or the Maximum Entropy method. The new 
method can deal with parameters appearing nonlinearly in any place; the plant 
matrix, the input matrix, the disturbance matrix, the output matrix, and the 
measurement matrix. 

Numerical examples show that the method is effective in improving 
robustness to parameter variations. For examples with a simple parameter structure 
other methods can be applied (Maximum Entropy and Wagie, Skelton). In some of 
these examples the new method performed as well as these existing methods. In 
other examples the new method performed better. The disadvantage of the method 
is the lack of a closed form solution. Iterative algorithms are required whose 
convergence remains an open question for future research. 
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Tabic 1 Performances of the Standard LQG Controller 


Standard LQG Controller 

Performances 

Gain Margin 

K«« 

4.087 

Stability Range of p 
P = P/(P)nom 

^max 

1.173 

P . 

‘ min 

0.677 

Stability Range of El 

P = EI/(EI) NO m 

p 

‘max 

3.226 

p . 

‘nun 

0.863 

Input & Output Cost 

v y 

2.760 

v u 

0.488 


Table 2 Performances of Controllers Type A 


Sensitivity Reducing 
Controller for p 
Type A 

Performances 

Type A-i 

p = o.oi 

Type A-2 

p = o.i 

Type A-3 
P = 0.5 

Gain Margin 


3.247 

2.300 

1.706 

Stability Range p 
P = P/(P)nom 

^max 

1.415 

1.687 

1.802 

p . 

‘min 

0.681 

0.596 

0.435 

Stability Range of El 

P = EI/(EI) NOM 

p 

*max 

2.774 

2.704 

2.734 

p . 

* mm 

0.733 

0.631 

0.607 

Input & Output Cost 


2.856 

3.147 | 

3.821 

V u 

1.300 

3.709 

11.243 


Table 3 Performances of Controllers Type B 


Sensitivity Reducing 
Controllers for p and K, 
Type B 

Performances 

Type A-2 TypcB-1 Type B-2 

P, =0.1 P, =0.1 p,=o.i 

P 2 = () P 2 =l p 2 =l() 

Gain Margin 


2.3(X) 

3.054 

4.843 

Stability Range p 
P = P/(p)nom 


1.687 

0.596 

1.567 

0.661 

1.360 

0.687 

P 

1 min 

Stability Range of El 
P = EI/(EI) N om 

p 

1 mux 

2.704 

0.631 

1.618 

0.669 

1.526 

0.755 

p 

1 mm 

Input & Output Cost 


3.147 

3.709 

2.913 

3.692 

2.723 

3.865 

v u 





Actuator 
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Figure 2 Output Cost Performance Change due to 
p Variation for Different Controllers 
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Figure 4 Performance Comparison of Different Controller 
Designs (Uncertain Parameter: K ) 
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Abstract 

The existing q-Markov COVER realization theory does not take into account 
the problems of arithmetic errors due to both the quantization of states and 
coefficients of the reduced order model. All q-Markov COVERS allow some 
freedom in the choice of parameters. In this paper we exploit this freedom in the 
existing theory to optimize the models with respect to these finite wordlength 
effects. 
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Introduction 

An asymptotically stable system can be characterized in terms of its impulse 
response sequence (Markov parameters) and its output covariance sequence 
(covariance parameters) due to a zero mean white noise input process. A general 
approach has been developed [3] for realizing a system which matches q Markov 
parameters and q covariance parameters. Such a system is referred to as a q- 
Markov COVER, and q-Markov COVERS may be generated from output data 
[3,4] or from higher order models [5,6]. The Markov and covariance parameters 
are not independent and consequendy the q-Markov COVER is not unique. In 
particular, all q-Markov COVERS are not related by state space similarity 
transformations [4]. In this paper we shall exploit the remaining degrees of free- 
dom to optimize the q-Markov COVER realization with respect to an aspect of its 
finite wordlength realization. 

Specifically, when digital controllers are to be implemented, both the con- 
troller coefficients and the controller states must be represented in finite 
wordlength precision. This finite wordlength (FWL) representation (or quantiza- 
tion) causes inaccuracies in the response when compared to the ideal (i.e. infinite 
precision) behaviour. Effects Of quantization on the controller are increased noise 
at the output due to internal state quantization, and errors in time and frequency 
response characteristics due to coefficient errors. 

In digital filter design, the FWL effects are known to be most significant 
when the poles of the filter are very close to the unit circle [12]. In particular, 
narrow band filters have all these poles near z = l±jo. For digital control, the 
zero-order-hold equivalent of a continuous time model (or controller) with a pole 
at X will have a discrete pole at exp (XT). Hence for fast sampling and/or low 
damping of the continuous models, the discrete model will behave like a narrow 
band filter. The synthesis of optimal digital controllers with respect to 
arithemetic quantization noise is an important consideration in design especially 
for continuous time systems operating under a fast sampling rate [9,10]. The 
effects of quantization depend highly on the structure of the controller. This 
paper seeks to reduce these errors in the synthesis of q-Markov COVERS. 

1. Discrete q-Markov COVER 

Consider the asymptotically stable nominal discrete system 
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x(k+l) = Ax(k) + Bu(k) ; xOOeR"*, u(k)eR n ' 
y(k) = Cx(k) ; y(k)eR nr vU) 

where (u(k)} is a zero mean process with unit intensity E {u(k)u (j)} = 18^ and 
E (x(k)u*(j)J = 0 for k £ j. The Markov parameters Mj and covariance parameters 
Rj of (1.1) are defined by 

Mj = CA‘B ; Rj = CA^XC*, j 2: 0, Rj = CXA*jC*, j £ 0 (1.2) 

where the state covariance matrix X satisfies the Lyapunov Equation 

X = AXA* + BB* . (1.3) 


These parameters Mj and Rj appear as coefficients in the expansion of the transfer 
function H(z) and power spectral density H(z)H*(z -1 ); that is 


H(z) = C(zI-A) -1 B = XMiZ-^ 1 *; H(z)H V 1 ) = £ RjZ _j 

i=0 j =— 

We suppose that as data we are given the first q-Markov and first q-covariance 
parameters {Mi, R^ i = 0, 1, q-1 } of an asymptotically stable system from 

which we construct die two data matrices 


£> q ^R q - 

ER W 

where R q , M q and M q are the Toeplitz matrices of the data as defined by 

Ro Ri ... Rq-l 

Ri Ro ... Rq-2 



Rq-1 Rq-2 ... Ro 



0 0 ... 0 0 


Mq 0 ... 0‘ 


Mo 0 ... 0 0 


Mi Mq ... 0 

M q = 

Ml Mq 

• • • 

• ♦ • 

’*1 

•O 

II 

• • • • 

• • * • 

• • • # 

Mq_ 2 0 


Mq_ 2 Mq_ 3 ... Mq 0 


Mq.! Mq_ 2 ... Mq 


(1.4a) 


(1.4b) 


The first data matrix D q in (1.4a) is Hermitian and it is shown in [3-4] to be 
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positive semidefinite. Hence we can obtain a (nonunique) full rank factorization 


D q = PqP q *; PqER^, 

(1.5a) 

where 


r q = rank (D q ) = rank(P q ) £ iyj 

(1.5b) 

If we partition P q according to 


Pq = [E q F q ]; E q eR w , F q eR (q_1 ^ 

(1.6) 

then it follows that the second data matrix D q can be factored as 


D q = P q P q ; PqER^ 

(1.7) 

where 


Pq = [F q Gq); G qE R^ 

(1.8) 


for some G q (to be determined). The following result has been established. 
Theorem 1.1 [3] 

Given the q Maikov parameters {M*; i = 0,1, ..., q— 1) and the q covariance 

parameters {R i ;i = 0,l, q— 1 } and a matrix G in (1.8) such that (1.7) is 

satisfied, then the realization { Aq, B q , C q } of order r q defined by 

A q = P q + P q ; B q = P q + [Mo* ••• M^J*; C q = E q (1.9) 

where P q denotes the Moore-Penrose inverse of P is a q-Markov COVER. The 
corresponding controllability grammian Xq is given by 

X q = I (1.10) 

Furthermore 

Pq = [Cq A q C q * * * (Aq - 1 )*C q ]* (1.11) 


nm 

This theorem describes a large but not complete class C q of q-Markov COVERS 
parameterized by {G q } such that for some E qt F q the data matrices D q ,D q satisfy 
(1.5)-(1.8). Each matrix G q will (generally) result in a q-Markov COVER having 
a different transfer function. In order to compute the set of all such G q , observe 
in (1.5)-(1.8) that 
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D q = 


(E q * Fq) 


(1.12a) 


Then 


D q = 


Dq-l ^ 


XI 

.dq‘ dqq 


[gJ 


[F q * Gq] 


(1.12b) 


dq qC/?”^ 


implies 


EqEq = R* , FqFq = D^, F q G q * = dq, G q G q * = d, q (1.13) 


Now expand D q in terms of its singular value decomposition 


Dq^Uz) 


Si 0 
0 0 


u; 

U 2 * 




Then from (1.12a) 


(EqFq^^SW 


(1.14) 


(1.15) 


so that E q = C q is defined by the first ny rows and F q by the last (q-r)ny rows of 

UiSi 172 . Define 


p q = rank (F q ) 


Then from (1.15) 


(1.16a) 

(1.16b) 


p q £ min (r q , (q-l)ny) . 

Next, expand Fq in (1.13) in terms of its singular value decomposition. If strict 
inequality occurs in (1.16b) we have 


Fq=[U 0 Uf>] 


2*° 
0 0 


; 2q6R w ’ 


The Moore-Penrose inverse Fq of F q is then given by ' 


(U7) 


(U8) 


Corollary 1.1 
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Define 


(i) Gqj^CF+dq)* (1.19) 

(ii) G q2 G R such that G q2 G^ = d^-dq’D^dq 


where 

s q = rank [dq q - djD dq] 

(1.20) 

and 

(iii) G q3 = VpG/? (r '" p,)xr ’. 

(121) 

Then if strict inequality occurs in (1.16b) the set of all G q which satisfy (1.13) are 

defined by 

Gq = Gqi+Gq 2 UqGq 3 

(1.22a) 

where 

UqfiR^; Sq^Tq-Pq^ny 

(1.22b) 

is an arbitrary row unitary matrix (i.e. U q U q = I). Furthermore, if the Moore- 

Penrose P q of 

P q =IEq* F q T 

(12 3) 

is expressed as 

Pq + = CLn Lid; L n G*^>\ L 12 g RW 

(1.24) 


then the corresponding state space representation {A q , B q , C q ) of the q-Markov 
COVER is given by 


Aq-Ln +L 12 G q ; Ln = L n F q e 

B q = P q + [Mo*Mi • • • M^j]* ; Cq = Eq . (1.25) 

If r q = Pq, then G q = G ql is unique. 

Proof: The expression forF q G q in (1.13) implies G q is of the form 

Gq = F q + dq + G* 3 M*; M g 

for some M. Then expanding G q G q using (1.13) we have 
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dq, = d^(F^)*Fqdq + dq*(F£*G^M* + MG^F+dq + MG q3 G^M 
Also from (1.13) and (1.21) 

(F q + )*F q + = D^ 1 , G^G *) = I ; (Fj)*G<J,*=0 

so that 


MM* = dq q - d^(F q )*F q dq 

Since MM* has rank s q , 

s q = rank(G q2 G q2 ) £ r q - p q 


(1.26) 


2. Optimal Finite Wordlength q-Markov COVER 

A fixed point finite wordlength realization of the ideal (i.e. infinite precision) 
q-Markov COVER (1.1) shall be referred to as a q-FWL Markov COVER and is 
described by 

x(k+l) = AQ[x(k)] + Bu(k) 

?(k) = eQ[*(k)] (2.1) 

Q[*(k)] = x(k)-e(k) 

The components of the matrices A, B, C are assumed to have a W 0 bit fractional 
representation obtained by quantization of the components of A, B, C in (1.1). 
The components of 2(k) have a W+W 0 bit fractional part while components of 
Q[x(k)] and Q(k) all have a W bit fractional part The components of the state 
residue vector e(k) has a W+W 0 bit fractional representation in which the most 
significant W bits are zero. The LHS and RHS of (2.1) are therefore consistent 
with respect to their fractional wordlength representation. The number of bits 

A A A 

required to represent the integer parts of A, B and C depend on the dynamic range 
of the coefficients. State space structures in which all coefficients are less than 
unity are therefore advantageous in this regard. The required integer representa- 
tion of Q[x(k)] will depend on the dynamic range of the input signal u(k). Inade- 
quate dynamic range will result in arithmetic overflow. The accuracy in the com- 
putation of x(k) is determined by its fractional wordlength W. 

Define the state error vector e^k) and output error vector Ey(k) by 
e*(k) = x(k) - x(k); £y(k) = y(k) - y(k) 

Then from (1.1), (2.1) and (2.2) 


( 2 . 2 ) 
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ExOc+l) = Ae*(k) + Ae(k) + AAQ[x(k)] + ABu(k) + BAu(k) (2.3) 
EyOc) = C^Oc) + Ce(k) + ACQ[x(k)] 

where 

AA = A - A; AB = B - B; AC = C - C 
Au(k) = fl(k) -u(k) 

There are five terms which contribute to the output error (i) internal arithmetic 
errors e(k) due to state quantization (ii) coefficient errors due to errors AA in A 
(iii) AB in B (iv) AC in C, and (v) input quantization errors Au(k). Under weak 
’sufficiently exciting’ conditions on the input (u(k)} it can be shown [6] that if 
Q[‘] in (2.1) denotes ’roundoff quantization, then (e(k)) is a zero mean uniform 
white process with covariance 

E (e(k)e*(k)} =y 2 I; y 2 =-^- 2' 2W . (2.4) 

Similarly (Au(k)} is assumed to be a zero mean white uniform process with 

£(Au(k)A*u(k)}=y 2 I (2.5) 

We assume that the quantized coefficients A, B, C are obtained by rounding A, B, 
C to W 0 bit fractions. Consequently, all components Ap of the error matrices AA, 
AB, AC satisfy 

lApI <y 0 ; Yo = “2- W ”. (2.6) 

For simplicity we normalize the error matrices and define 5A, SB, SC by 

5A = — AA; SB = — AB; 5C = — AC (2.7) 

Yo Yo Yo 

so that all components 5p of SA, SB and SC satisfy 

I Sp 1 < 1 . (2.8) 

The steady state output error covariance Y of {£ y (k)} is then given by 

Y = CPC* + ^CC* + Yo (SC) (X-Ky 2 !) (SC)* - Y 0 Y Z [C(SC)* + (5C)C*] , (2.9) 


where 


P = £{e x (k)e x *(k)} 
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* APA* + y*AA* + Yo(5A)(X+y 2 I)(8A)* + y 2 (5B)(8B)* + ^BB* 
and 

X = E {*(k)**(k)} = AX(A)* + y*A(A)* + (1+T^)BB* 

For the remainder of this section we assume no coefficient errors (i.e. y 0 = 0 in 
(2.9)) and consider only the effects due to finite state wordlength (FSWL). The 
issue of coefficient error shall be resumed in Section S. 

Theorem 2.1 

Define the output noise measure 
Then for Y o = 0 

J = Y 2 {tr[K] + tr[B*KB] } 

where 

K = A*KA + C*C . 


( 2 . 10 ) 

( 2 . 11 ) 


Proof: From (2.9) 

Y = CPC*; P = APA* + 7 2 Z = P + fl 

where 

Z = I + BB*; 

Now 


and 


P = Y* £ A k Z(A k )* 

fc =0 


so that 


K= £ (A k )*C*CA k 

lc=0 

tr[CPC*] = Y 2 tr(ZK). 


□m 
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A fixed point q-FSWL Markov COVER corresponding to the (ideal) q- 
Maikov COVER (1.1) is therefore described by 

*(k+l) = AQ[x(k)] + Bu(k) 

y(k) = CQ(x(k)] (2.12) 

Q[x(k)] = *(k)-e(k) 

The output noise gain (q x ) due to state quantization and the output noise gain 
Cry due to input quantization are defined by 

T| x ^ tr[K]; T| u ^ tr[B*KB) (2.13) 

The noise gain t\ x generally varies with state space representation whereas t] u is 
independent of the coordinate basis. Specifically, consider the q-FSWL Markov 
COVER 

z(k+l) = A Q[z(k)] + B u(k) 

y(k) = CQ[z(k)] (2.14a) 

Q[z(k)] = z(k)-f(k) 

where 

A = T^AT, B = T^B, C =CT (2.14b) 

and Q[z(k)] has a W bit fractional representation. Assuming 'sufficient excita- 
tion’ by u(k), the state residue sequence {f(k)J in (2.14a) due to roundoff quanti- 
zation will again be a zero mean white uniform process with covariance as in 
(2.5). The corresponding output quantization noise gains T| z and Tj u due respec- 
tively to state and input quantization are given by 

ti z = tr[KJ; ^ u = tr[fiXB] (2.15) 

where B is given by (2.14b) and 

K Z = AK ] A* + C*C . (2.16) 

But from (2.11), Kj = T*KT, so that 

ti z = tr[T*KTl; ii u = tr[B‘KB] (2.17) 

Notice from (2.13) that the noise gain ri u due to input quantizadon errors is unaf- 
fected by a similarity transformation. Conversely the noise gain T| x due to state 
quantization generally changes with co-ordinate bases. There is no change if T is 
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unitary. The q-FSWL Markov COVER (2.14) is superior to the q-FSWL Markov 
COVER (2.12) if 

(2.18) 

However the comparison in (2.18) must be made under the assumption of identi- 
cal scaling of the states &(k) and 2(k). Specifically, equal ^-scaling of gain a 
from a zero mean unit intensity white noise input fl(k) to the state components 
$j(k) of x(k) requires 

Xjjsaforallj (2.19) 

where Xjj denotes die jth diagonal component of the state covariance matrix X 
given by (1.3). Equal ^-scaling of gain a of components of z(k) in (2.14) 
requires 

Z^a; Z=AZA*+BB* (2.20) 

Equality in ^-scaling of representations (2.12) and (2.14) is equivalent to equality 
in the state dynamic range (i.e. number of bits in the integer representation of 
states) for a given probability of overflow. We now state a result which is impor- 
tant for establishing l 2 -scaling. 


Lemma 2.1 [8,9] Suppose M = M* > 0 is an nxn matrix. Then a necessary and 
sufficient condition for the existence of ^unitary matrix V such that 

(yM V^= a for all j 

is ^ 

tr[M] = na 


nm 

We have shown in Lemma 1.1 that different similarity transformations of an 
ideal q-Markov COVER corresponds to different factorization of the first data 
matrix D q in (1.5a). Our aim is to optimize this factorization. 

Definition 2.1 

The Optimal q-FSWL Markov COVER minimizes the output quantization 
noise gain T) due to state quantization errors; that is 
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ti opt =mintr|T*K q Tl; T*T = A* 1 
^ T.G, H 

subject to the l 2 -scaling constraint: 

Ajj = a for all j 

where the observability grammian Kq satisfies 

K q =A q *K q A q +C q *C q 
with {Aq, B q , Cq) defined by (1.22)-(1.25). 


( 2 . 21 ) 

( 2 . 22 ) 

(2.23) 


In corollary 1.1 we have shown that all the degrees of freedom available to 
select G q are confined to an arbitrary row unitary matrix U q . We now show how 
to optimize U q . 

Theorem 2.1 

a. The optimal q-FSWL Markov COVER (1.21a), (1.24) is defined by 

Ti 0 ^=r“ 1 min(tr[Kq V4 ]) 2 (2.24) 

where U q e ^ ^ arbitrary row unitary matrix and Kq satisfies 

(2.23). 

b. The transfer function of the optimal q-FSWL Markov COVER has Hankel 
singular values given by the eigenvalues of Kq defined by the minimizing 

U T 

c. Suppose Uq = Uq„ is die minimizing solution corresponding to the optimal 
Gq-Gqo in (1.21). Let {Aq,,, Bq,,, Cq,,} be the corresponding state space 
realization in (1.24). Then the optimal q-FSWL Markov COVER has a 
(nonunique) state space representation {T^AqoT,,, T^ l B qo , C qo Ty where 

T 0 = U 0 7t 0 V* (2.25) 

such that 

(i) the unitary matrix U 0 is defined by 

U 0 *Kq 0 U 0 = Io 2 (2.26a) 


where 
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Kqo “ AqoKqoAqo + Cqo^qo » ^ diag ( CT^q, ^20» •••* ®r^ol (2.26b) 
in which {o^} are the optimal Hankel singular values (eigenvalues of Kq 0 ). 

(ii) 


*. 2 =-5-(£<wz.-* 

a% tei 

and (iii) V 0 is unitary such that 

*< 

(V.2.V 0 ’) Jj = J=|— for all j 
r q 


opt - n q (Optimal) = -|— ( 2 <?ko) 2 

CtTr q lc=l 


(2.27) 


(2.28) 

(2.29) 


Proof: By corollary 1.1 we have for G q defined by (1.22) for any row unitary 
matrix U q (of appropriately specified dimensions) that G q defines a q-Markov 
COVER. The corresponding realization { A q 3 q .C q ) for each such U q has identity 
controllability grammian and observability grammian Kq defined by (2.23). Now 
given a particular U q , apply a similarity transformation 

T = U 0 Jt 0 V 0 * 

to the given q-Markov COVER. Then 

trCr*K q T) = tr(jt 0 2 U 0 *K q U 0 ) 
and 

(T*T) _1 = V 0 7Co 2 V 0 * 

By lemma 2.1, the l 2 -scaling constant can be satisfied for some V 0 provided 
tr(7t“ 2 ) = no. Following Williamson [1, Theorem 4.1] (with a minor modification 
of the l 2 -scaling constraint), the optimal performance is given by 

(E<Jj > 2 
Vpt ~ ^ 

where {a 2 } arc the eigenvalues of Kq. That is. 
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traC*) = £<Xj 

j=i 

The optimal q-FSWL Markov COVER therefore achieves the minimum in (2.24). 
The structure of U 0 , jc 0 , V 0 in (2.25)-(2.29) follow directly from Williamson [1] 
(see proof of Theorem 4.1 with U = I). 

3. Computation of the Optimal FSWL Markov COVER 

Necessary conditions for the optimal solution in Theorem 2.1 can be 
obtained using the method of Lagrange multipliers. Specifically, let 

J = (tr[K *]) 2 + tr[A(-Kq+Aq*KqA q + Cq*C q )] + tr[Q(I— U q Uq)] (3.1a) 

where 

A = A* € R t * x + Cl = O* e R ***' (3.1b) 

are symmetric Lagrange multipliers. After taking derivatives of J using (122) 
and (1.25) 

=K q“ AqKqAq - C q C q 


■Sz = 2I + 2AK* + 2A q *AA q K* (3.2) 

dKq 

= 2G^r 2W Gi - 2QU, 

By setting these derivatives to zero we obtain the following result. 

Lemma 3.1 Necessary conditions for the derivation of the optimal q-FSWL Mar- 
kov COVER are given by 

Kq = AqKqA q + C q C q 
-A = A q AA q + ; A = A 

UqUq = I ; Uq € R (3.3) 


c* a 
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nu q -p q u q Q q =R q ; n=n*eR s ^' 

where 

Pq = Pq* = GiLr 2 K 1 L 12 G q2 bR s ^ 

Qq = Qq = G q3 AG *3 e R (3.4) 

Rq = G^ 2 L* 2 K q (L 11 +L 12 G ql )AG*3 e R 
and Aq, Gqj, Lj are defined by (1.20)- (1.24) 

mn 

These necessary conditions cannot be solved explicidy for the optimal row 
unitary matrix U q and so an iterative procedure is required. One possible algo- 
rithm is now described. 

Recursive Algorithm for Optimal q-FSWL Markov COVER: 

(0) Set j = 0 and choose any row unitary U q (0) in (1.21a) 

(1) Form Aq(j) from 

A q (j) = (Ln + L 12 G ql ) + Li 2 Gq 2 U q (j)G q 3 (3.5a) 

(2) Compute KqO): K x 0) = A^O)KqO)AqO) + C q *C q (3.5b) 

(3) Compute AO): -AQ) = Aq*(j)A(j)A q (j) + K^(j); A(j) = A*©.5c) 

(4) Compute P q (j),Q q 0)R q (j): 

Pq(j) = Gq 2 Lj* 2 Kq(j)L 12 G q2 ; Qq(j) = G q3 A(j)G q 3J 

RqO) = G q2 L 12 Kq(j)(Li i+L 12 G ql )A(j)G q 3 (3.5d) 

(5) Update U q Q) by solving the nonlinear algebra problem: 

fl(j)U q (j+l) - P q (j)Uq(j+l)Qq(j) = RqO); «(j) = 0*0) (3.5e) 

UqO+l)U q Vl) = I 

The most difficult step at each stage of the algorithm is to solve (3.5e) for a row 
unitary U q 0+1) and symmetric GO)- There is generally no explicit solution 



